2 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 2 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 9 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 10 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 12 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 16 tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases 18 Aminotransferase class-III 25 SH2 domain 25 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 25 SH2 domain 25 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 26 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 26 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 27 SH2 domain 27 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 27 SH2 domain 27 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 27 SH2 domain 27 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 28 Glycosyltransferase family 6 29 C2 domain 29 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 29 C2 domain 29 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 31 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 32 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 38 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 40 Amiloride-sensitive sodium channel 41 Amiloride-sensitive sodium channel 41 Amiloride-sensitive sodium channel 43 Carboxylesterase 43 Carboxylesterase 51 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 51 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 53 Histidine acid phosphatase 54 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 55 Histidine acid phosphatase 58 Actin 59 Actin 60 Actin 70 Actin 71 Actin 72 Actin 88 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 90 Protein kinase domain 90 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 91 Protein kinase domain 91 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 91 Protein kinase domain 91 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 91 Protein kinase domain 91 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 92 Protein kinase domain 92 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 93 Protein kinase domain 93 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 94 Protein kinase domain 94 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 95 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 97 Acylphosphatase 98 Acylphosphatase 100 Adenosine/AMP deaminase 101 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 101 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 102 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 103 Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r 103 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 103 Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r 103 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 103 Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r 103 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 104 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 104 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 104 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 105 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 107 Adenylate and Guanylate cyclase catalytic domain 107 Adenylate and Guanylate cyclase catalytic domain 108 Adenylate and Guanylate cyclase catalytic domain 109 Adenylate and Guanylate cyclase catalytic domain 112 Adenylate and Guanylate cyclase catalytic domain 112 Adenylate and Guanylate cyclase catalytic domain 113 Adenylate and Guanylate cyclase catalytic domain 114 Adenylate and Guanylate cyclase catalytic domain 115 Adenylate and Guanylate cyclase catalytic domain 116 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 117 7 transmembrane receptor (Secretin family) 118 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 118 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 118 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 118 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 119 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 119 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 119 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 119 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 119 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 119 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 120 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 120 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 123 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 132 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 132 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 133 Adrenomedullin 134 7 transmembrane receptor (rhodopsin family) 135 7 transmembrane receptor (rhodopsin family) 136 7 transmembrane receptor (rhodopsin family) 140 7 transmembrane receptor (rhodopsin family) 141 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi 142 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc 142 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 142 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 143 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 143 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 146 7 transmembrane receptor (rhodopsin family) 147 7 transmembrane receptor (rhodopsin family) 148 7 transmembrane receptor (rhodopsin family) 148 7 transmembrane receptor (rhodopsin family) 148 7 transmembrane receptor (rhodopsin family) 148 7 transmembrane receptor (rhodopsin family) 150 7 transmembrane receptor (rhodopsin family) 151 7 transmembrane receptor (rhodopsin family) 152 7 transmembrane receptor (rhodopsin family) 153 7 transmembrane receptor (rhodopsin family) 154 7 transmembrane receptor (rhodopsin family) 155 7 transmembrane receptor (rhodopsin family) 156 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 157 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 159 Adenylosuccinate synthetase 160 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 160 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 160 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 160 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 160 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 160 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 161 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 161 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 161 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 162 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 162 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 162 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 162 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 163 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 163 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 164 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 164 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 165 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 166 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 173 Serum albumin family 174 Serum albumin family 175 Asparaginase 176 Extracellular link domain 176 Extracellular link domain 176 Sushi domain (SCR repeat) 176 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 182 Delta serrate ligand 183 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 185 7 transmembrane receptor (rhodopsin family) 185 7 transmembrane receptor (rhodopsin family) 185 7 transmembrane receptor (rhodopsin family) 185 7 transmembrane receptor (rhodopsin family) 185 7 transmembrane receptor (rhodopsin family) 186 7 transmembrane receptor (rhodopsin family) 187 7 transmembrane receptor (rhodopsin family) 190 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 191 S-adenosyl-L-homocysteine hydrolase 196 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 196 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 197 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 202 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 203 Adenylate kinase 204 Adenylate kinase 204 Adenylate kinase 204 Adenylate kinase 205 Adenylate kinase 210 Delta-aminolevulinic acid dehydratase 213 Serum albumin family 216 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 217 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 218 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 219 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 220 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 221 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 222 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 223 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 224 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 226 Fructose-bisphosphate aldolase class-I 229 Fructose-bisphosphate aldolase class-I 230 Fructose-bisphosphate aldolase class-I 231 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 238 MAM domain. An extracellular domain found in many receptors 239 Lipoxygenase 239 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 240 Lipoxygenase 241 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 242 Lipoxygenase 242 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 244 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 244 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 246 Lipoxygenase 247 Lipoxygenase 247 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 248 Alkaline phosphatase 249 Alkaline phosphatase 250 Alkaline phosphatase 251 Alkaline phosphatase 257 Homeobox domain 259 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 262 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi 265 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m 265 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m 265 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m 266 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m 267 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 269 Protein kinase domain 269 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 270 Adenosine/AMP deaminase 271 Adenosine/AMP deaminase 272 Adenosine/AMP deaminase 273 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 273 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 274 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 275 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 276 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe 276 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st 279 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe 279 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st 280 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe 280 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st 283 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 285 Uncharacterized ACR, COG1579 286 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 286 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 286 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 286 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 287 Death domain 287 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 287 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 287 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 287 Death domain 287 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 287 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 288 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 288 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 290 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 291 Mitochondrial carrier protein 292 Mitochondrial carrier protein 293 Mitochondrial carrier protein 301 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 302 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 306 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 307 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 308 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 309 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 309 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 310 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 310 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 311 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 311 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 311 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 312 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 313 Lipase/Acylhydrolase with GDSL-like motif 314 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 314 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 314 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 314 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 314 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 314 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 317 NB-ARC domain 317 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 317 NB-ARC domain 317 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 317 NB-ARC domain 317 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 317 NB-ARC domain 317 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 317 NB-ARC domain 317 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 318 NUDIX domain 318 NUDIX domain 318 NUDIX domain 320 Phosphotyrosine interaction domain (PTB/PID) 320 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 321 Phosphotyrosine interaction domain (PTB/PID) 321 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 322 Phosphotyrosine interaction domain (PTB/PID) 322 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 322 Phosphotyrosine interaction domain (PTB/PID) 322 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 323 Phosphotyrosine interaction domain (PTB/PID) 325 Pentaxin family. Pentaxins are also known as pentraxins 326 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 326 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 326 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 326 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 326 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 329 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 330 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 333 Amyloid A4 extracellular domain 334 Amyloid A4 extracellular domain 335 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 337 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 338 Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitell 339 Cytidine and deoxycytidylate deaminase zinc-binding region 343 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 348 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 350 Sushi domain (SCR repeat) 351 Amyloid A4 extracellular domain 351 Beta-amyloid peptide (beta-APP) 355 TNFR/NGFR cysteine-rich region 355 TNFR/NGFR cysteine-rich region 355 TNFR/NGFR cysteine-rich region 357 Uncharacterized ACR, COG1579 358 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 359 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 361 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 361 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 362 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 363 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 363 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 367 Androgen receptor 367 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 367 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 369 Protein kinase domain 369 Raf-like Ras-binding domain 369 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 372 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 373 ADP-ribosylation factor family 373 ADP-ribosylation factor family 373 ADP-ribosylation factor family 379 ADP-ribosylation factor family 383 Arginase family 384 Arginase family 393 SH3 domain 393 RhoGAP domain 393 Fes/CIP4 homology domain 394 FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions 396 RHO protein GDP dissociation inhibitor 397 RHO protein GDP dissociation inhibitor 398 RHO protein GDP dissociation inhibitor 400 ADP-ribosylation factor family 401 Homeobox domain 402 ADP-ribosylation factor family 403 ADP-ribosylation factor family 405 Helix-loop-helix DNA-binding domain 405 Helix-loop-helix DNA-binding domain 405 Helix-loop-helix DNA-binding domain 406 Helix-loop-helix DNA-binding domain 407 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 407 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 408 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 408 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 408 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 408 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 409 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 409 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 410 Sulfatase 411 Sulfatase 412 Sulfatase 414 Sulfatase 414 Sulfatase 415 Sulfatase 416 Sulfatase 417 NAD:arginine ADP-ribosyltransferase 419 NAD:arginine ADP-ribosyltransferase 420 NAD:arginine ADP-ribosyltransferase 427 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 427 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 432 Hepatic lectin, N-terminal domain 432 Lectin C-type domain. This family includes both long and short form C-type 433 Hepatic lectin, N-terminal domain 433 Lectin C-type domain. This family includes both long and short form C-type 433 Hepatic lectin, N-terminal domain 433 Lectin C-type domain. This family includes both long and short form C-type 433 Hepatic lectin, N-terminal domain 433 Lectin C-type domain. This family includes both long and short form C-type 433 Hepatic lectin, N-terminal domain 433 Lectin C-type domain. This family includes both long and short form C-type 438 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 439 Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell 445 Arginosuccinate synthase. This family contains a PP-loop motif 445 Arginosuccinate synthase. This family contains a PP-loop motif 462 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 463 Homeobox domain 466 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 466 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 468 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 468 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 469 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 472 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 472 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 473 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 473 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 480 E1-E2 ATPase 480 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 480 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 480 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 481 Sodium / potassium ATPase beta chain 482 Sodium / potassium ATPase beta chain 483 Sodium / potassium ATPase beta chain 486 ATP1G1/PLM/MAT8 family 486 ATP1G1/PLM/MAT8 family 496 Sodium / potassium ATPase beta chain 501 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 509 ATP synthase 516 ATP synthase subunit C 517 ATP synthase subunit C 518 ATP synthase subunit C 527 ATP synthase subunit C 528 V-ATPase subunit C 529 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 534 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 534 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 535 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 538 Heavy-metal-associated domain 539 ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401 540 Heavy-metal-associated domain 545 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 546 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 546 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 546 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 546 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 546 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 546 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 547 Kinesin motor domain 547 PH domain. PH stands for pleckstrin homology 547 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 551 Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5 552 7 transmembrane receptor (rhodopsin family) 553 7 transmembrane receptor (rhodopsin family) 554 7 transmembrane receptor (rhodopsin family) 558 Fibronectin type III domain 558 Fibronectin type III domain 563 Class I Histocompatibility antigen, domains alpha 1 and 2 571 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 573 BAG domain. Domain present in Hsp70 regulators 575 7 transmembrane receptor (Secretin family) 575 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 575 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 576 7 transmembrane receptor (Secretin family) 576 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 576 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 577 7 transmembrane receptor (Secretin family) 577 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 577 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 578 Apoptosis regulator proteins, Bcl-2 family 579 Homeobox domain 580 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 581 Apoptosis regulator proteins, Bcl-2 family 581 Apoptosis regulator proteins, Bcl-2 family 581 Apoptosis regulator proteins, Bcl-2 family 586 Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci 587 Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci 590 Carboxylesterase 593 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 595 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 596 Apoptosis regulator proteins, Bcl-2 family 596 Apoptosis regulator proteins, Bcl-2 family 597 Apoptosis regulator proteins, Bcl-2 family 598 Apoptosis regulator proteins, Bcl-2 family 598 Apoptosis regulator proteins, Bcl-2 family 599 Apoptosis regulator proteins, Bcl-2 family 604 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 604 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 610 Cyclic nucleotide-binding domain 610 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 611 7 transmembrane receptor (rhodopsin family) 613 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 613 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 613 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 613 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 622 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 623 7 transmembrane receptor (rhodopsin family) 624 7 transmembrane receptor (rhodopsin family) 627 Nerve growth factor family 627 Nerve growth factor family 627 Nerve growth factor family 627 Nerve growth factor family 627 Nerve growth factor family 627 Nerve growth factor family 629 von Willebrand factor type A domain 631 Intermediate filament protein 632 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 635 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 640 SH2 domain 640 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 642 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 643 7 transmembrane receptor (rhodopsin family) 643 7 transmembrane receptor (rhodopsin family) 644 Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD 648 Zinc finger, C3HC4 type (RING finger) 649 CUB domain 649 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 649 CUB domain 649 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 649 CUB domain 649 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 649 CUB domain 649 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 649 CUB domain 649 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 649 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 650 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 652 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 652 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 652 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 653 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 654 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 655 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 656 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 657 Protein kinase domain 657 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 658 Protein kinase domain 658 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 659 Protein kinase domain 659 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 659 Protein kinase domain 659 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 662 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 662 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 662 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 662 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 666 Apoptosis regulator proteins, Bcl-2 family 667 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 667 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 667 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 671 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 Zinc finger, C3HC4 type (RING finger) 672 Zinc finger, C3HC4 type (RING finger) 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 672 Zinc finger, C3HC4 type (RING finger) 672 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 673 Protein kinase domain 673 Raf-like Ras-binding domain 673 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 675 BRCA2 repeat. The alignment covers only the most conserved region of the repeat 680 7 transmembrane receptor (rhodopsin family) 683 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 686 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 688 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 694 BTG1 family. A novel family of anti-proliferative proteins 695 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 696 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 706 TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia 708 Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inte 710 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 712 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 712 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 713 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 715 CUB domain 715 Sushi domain (SCR repeat) 716 CUB domain 716 Sushi domain (SCR repeat) 717 Sushi domain (SCR repeat) 717 von Willebrand factor type A domain 718 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 718 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 718 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 718 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 719 7 transmembrane receptor (rhodopsin family) 720 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 720 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 720 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 720 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 721 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 721 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 721 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 721 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 722 Sushi domain (SCR repeat) 725 Sushi domain (SCR repeat) 726 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 727 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 727 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 727 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 727 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 728 7 transmembrane receptor (rhodopsin family) 729 Sushi domain (SCR repeat) 729 Thrombospondin type 1 domain 729 Low-density lipoprotein receptor domain class A 729 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 730 Sushi domain (SCR repeat) 730 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 731 Low-density lipoprotein receptor domain class A 731 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 732 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 733 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 735 Low-density lipoprotein receptor domain class A 735 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 738 Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35 744 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 747 Lipase (class 3) 759 Eukaryotic-type carbonic anhydrase 760 Eukaryotic-type carbonic anhydrase 761 Eukaryotic-type carbonic anhydrase 762 Eukaryotic-type carbonic anhydrase 763 Eukaryotic-type carbonic anhydrase 765 Eukaryotic-type carbonic anhydrase 766 Eukaryotic-type carbonic anhydrase 767 Eukaryotic-type carbonic anhydrase 768 Eukaryotic-type carbonic anhydrase 770 Eukaryotic-type carbonic anhydrase 771 Eukaryotic-type carbonic anhydrase 773 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 773 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 774 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 775 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 776 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 777 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 778 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 779 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 780 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 780 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 780 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 781 Cache domain 782 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 783 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 784 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 785 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 786 PMP-22/EMP/MP20/Claudin family 788 Mitochondrial carrier protein 790 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 790 MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also 790 Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in proka 795 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 799 7 transmembrane receptor (Secretin family) 799 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 800 Caldesmon 800 Caldesmon 800 Caldesmon 800 Caldesmon 800 Caldesmon 811 Calreticulin family 820 Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins 821 Calreticulin family 823 Calpain family cysteine protease 823 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 824 Calpain family cysteine protease 824 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 825 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 825 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 825 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 825 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 827 C2 domain 827 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 829 F-actin capping protein alpha subunit 830 F-actin capping protein alpha subunit 831 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 831 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 831 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 831 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 832 F-actin capping protein, beta subunit 834 ICE-like protease (caspase) p10 domain 834 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 834 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 834 ICE-like protease (caspase) p10 domain 834 ICE-like protease (caspase) p10 domain 835 ICE-like protease (caspase) p20 domain 835 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 835 ICE-like protease (caspase) p10 domain 835 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 835 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 835 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 837 ICE-like protease (caspase) p10 domain 837 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 837 ICE-like protease (caspase) p10 domain 837 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 837 ICE-like protease (caspase) p10 domain 838 ICE-like protease (caspase) p10 domain 838 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 839 ICE-like protease (caspase) p10 domain 840 ICE-like protease (caspase) p10 domain 840 ICE-like protease (caspase) p20 domain 841 Death effector domain 841 Death effector domain 841 Death effector domain 841 Death effector domain 841 Death effector domain 842 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 842 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 843 Death effector domain 843 ICE-like protease (caspase) p10 domain 843 Death effector domain 843 ICE-like protease (caspase) p10 domain 843 Death effector domain 843 Death effector domain 843 ICE-like protease (caspase) p10 domain 844 Calsequestrin 845 Calsequestrin 846 7 transmembrane receptor (metabotropic glutamate family) 846 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 847 Catalase 857 Caveolin 858 Caveolin 859 Caveolin 859 Caveolin 860 Runt domain 861 Runt domain 862 MYND finger 862 MYND finger 862 MYND finger 862 MYND finger 863 MYND finger 863 MYND finger 864 Runt domain 865 Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has b 865 Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has b 866 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 867 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 867 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co 867 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser 868 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 868 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co 868 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser 871 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 873 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 874 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 881 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 885 Gastrin/cholecystokinin family 886 7 transmembrane receptor (rhodopsin family) 887 7 transmembrane receptor (rhodopsin family) 890 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 894 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 896 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 898 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi 898 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 898 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi 898 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 899 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 900 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 901 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 908 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 909 Class I Histocompatibility antigen, domains alpha 1 and 2 921 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 928 Tetraspanin family 931 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 931 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 931 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 932 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 944 TNF(Tumor Necrosis Factor) family 948 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 949 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 950 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 951 Tetraspanin family 952 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 953 GDA1/CD39 (nucleoside phosphatase) family 954 GDA1/CD39 (nucleoside phosphatase) family 955 GDA1/CD39 (nucleoside phosphatase) family 956 GDA1/CD39 (nucleoside phosphatase) family 957 GDA1/CD39 (nucleoside phosphatase) family 959 TNF(Tumor Necrosis Factor) family 963 Tetraspanin family 966 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 967 Tetraspanin family 968 Lysosome-associated membrane glycoprotein (Lamp) 970 TNF(Tumor Necrosis Factor) family 975 Tetraspanin family 976 7 transmembrane receptor (Secretin family) 976 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 976 7 transmembrane receptor (Secretin family) 976 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 977 Tetraspanin family 977 Tetraspanin family 987 Beige/BEACH domain 989 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 999 Cadherin domain 999 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1000 Cadherin domain 1000 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1001 Cadherin domain 1001 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1002 Cadherin domain 1002 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1003 Cadherin domain 1003 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1004 Cadherin domain 1004 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1005 Cadherin domain 1005 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1005 Cadherin domain 1005 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1006 Cadherin domain 1006 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1007 Cadherin domain 1007 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1008 Cadherin domain 1008 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1009 Cadherin domain 1009 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1009 Cadherin domain 1010 Cadherin domain 1010 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1012 Cadherin domain 1013 Cadherin domain 1013 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1014 Cadherin domain 1015 Cadherin domain 1016 Cadherin domain 1016 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1020 Eukaryotic protein kinase domain 1026 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 1026 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 1027 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 1028 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 1040 Phosphatidate cytidylyltransferase 1044 Homeobox domain 1045 Homeobox domain 1046 Homeobox domain 1047 Calreticulin family 1056 Carboxylesterase 1056 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 1059 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 1066 Carboxylesterase 1075 Papain family cysteine protease 1081 Glycoprotein hormone 1103 Choline/Carnitine o-acyltransferase 1103 Choline/Carnitine o-acyltransferase 1103 Choline/Carnitine o-acyltransferase 1103 Choline/Carnitine o-acyltransferase 1105 'chromo' (CHRromatin Organization MOdifier) domain 1105 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 1106 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 1108 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 1109 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 1113 Granin (chromogranin or secretogranin) 1114 Granin (chromogranin or secretogranin) 1119 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 1120 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 1121 GDP dissociation inhibitor 1122 GDP dissociation inhibitor 1123 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 1123 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1124 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 1124 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1128 7 transmembrane receptor (rhodopsin family) 1129 7 transmembrane receptor (rhodopsin family) 1130 Perilipin family 1130 Beige/BEACH domain 1130 WD domain, G-beta repeat 1131 7 transmembrane receptor (rhodopsin family) 1132 7 transmembrane receptor (rhodopsin family) 1133 7 transmembrane receptor (rhodopsin family) 1134 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1134 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1135 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1135 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1136 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1136 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1137 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1137 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1138 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1138 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1139 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1139 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1140 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1140 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1141 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1141 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1142 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1142 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1143 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1143 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1144 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1144 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1145 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1145 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1146 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 1146 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 1149 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 1152 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 1152 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 1155 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 1158 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 1158 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 1159 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 1159 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 1160 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 1160 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 1163 Cyclin-dependent kinase regulatory subunit 1164 Cyclin-dependent kinase regulatory subunit 1173 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 1174 Clathrin adaptor complex small chain 1174 Clathrin adaptor complex small chain 1175 Clathrin adaptor complex small chain 1175 Clathrin adaptor complex small chain 1178 Galactoside-binding lectin 1180 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1181 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1182 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1183 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1184 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1185 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1185 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1185 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1185 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1186 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1186 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de 1187 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1188 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 1191 Clusterin 1201 CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease) 1207 Nucleotide-sensitive chloride conductance regulator (ICln) 1208 Colipase, C-terminal domain. SCOP reports duplication of common fold with Colipase N-terminal domain 1211 Clathrin light chain 1211 Clathrin light chain 1212 Clathrin light chain 1212 Clathrin light chain 1213 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 1213 Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le 1230 7 transmembrane receptor (rhodopsin family) 1231 7 transmembrane receptor (rhodopsin family) 1231 7 transmembrane receptor (rhodopsin family) 1232 7 transmembrane receptor (rhodopsin family) 1232 7 transmembrane receptor (rhodopsin family) 1233 7 transmembrane receptor (rhodopsin family) 1234 7 transmembrane receptor (rhodopsin family) 1235 7 transmembrane receptor (rhodopsin family) 1235 7 transmembrane receptor (rhodopsin family) 1236 7 transmembrane receptor (rhodopsin family) 1237 7 transmembrane receptor (rhodopsin family) 1238 7 transmembrane receptor (rhodopsin family) 1240 7 transmembrane receptor (rhodopsin family) 1241 7 transmembrane receptor (rhodopsin family) 1258 Cyclic nucleotide-binding domain 1258 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 1259 Cyclic nucleotide-binding domain 1259 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 1261 Cyclic nucleotide-binding domain 1261 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 1262 Cyclic nucleotide-binding domain 1262 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 1263 POLO box duplicated region 1264 Calponin family repeat 1265 Calponin family repeat 1266 Calponin family repeat 1268 7 transmembrane receptor (rhodopsin family) 1268 7 transmembrane receptor (rhodopsin family) 1268 7 transmembrane receptor (rhodopsin family) 1269 7 transmembrane receptor (rhodopsin family) 1270 Ciliary neurotrophic factor 1272 Fibronectin type III domain 1272 Fibronectin type III domain 1277 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 1277 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 1278 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1280 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 1281 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 1282 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1284 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1285 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1285 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1285 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1285 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1285 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1286 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1287 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1287 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1287 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1288 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1288 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 1290 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 1291 von Willebrand factor type A domain 1291 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1292 von Willebrand factor type A domain 1292 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1292 von Willebrand factor type A domain 1292 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1292 von Willebrand factor type A domain 1292 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1293 von Willebrand factor type A domain 1293 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1293 von Willebrand factor type A domain 1293 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1293 von Willebrand factor type A domain 1293 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1293 von Willebrand factor type A domain 1293 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1293 von Willebrand factor type A domain 1293 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1294 Fibronectin type III domain 1294 von Willebrand factor type A domain 1294 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1295 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 1295 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 1296 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 1296 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 1297 Thrombospondin N-terminal -like domain 1297 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1297 Thrombospondin N-terminal -like domain 1297 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1300 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 1301 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 1301 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1301 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 1301 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1301 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 1301 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1303 Fibronectin type III domain 1303 von Willebrand factor type A domain 1303 Thrombospondin N-terminal -like domain 1303 Fibronectin type III domain 1303 von Willebrand factor type A domain 1303 Thrombospondin N-terminal -like domain 1306 Thrombospondin N-terminal -like domain 1307 Thrombospondin N-terminal -like domain 1310 Thrombospondin N-terminal -like domain 1310 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 1314 Coatomer WD associated domain 1315 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 1317 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi 1318 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi 1325 Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som 1327 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 1329 Cytochrome c oxidase subunit Vb 1337 Cytochrome c oxidase subunit VIa 1339 Cytochrome c oxidase subunit VIa 1340 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 1345 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 1346 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 1347 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 1350 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast me 1351 Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII 1356 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 1357 Zinc carboxypeptidase 1358 Zinc carboxypeptidase 1358 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 1359 Zinc carboxypeptidase 1359 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 1360 Zinc carboxypeptidase 1360 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 1361 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 1361 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 1363 Zinc carboxypeptidase 1364 PMP-22/EMP/MP20/Claudin family 1365 PMP-22/EMP/MP20/Claudin family 1366 PMP-22/EMP/MP20/Claudin family 1368 Zinc carboxypeptidase 1369 Zinc carboxypeptidase 1371 Coproporphyrinogen III oxidase 1373 MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also 1374 Choline/Carnitine o-acyltransferase 1375 Choline/Carnitine o-acyltransferase 1375 Choline/Carnitine o-acyltransferase 1375 Choline/Carnitine o-acyltransferase 1375 Choline/Carnitine o-acyltransferase 1376 Choline/Carnitine o-acyltransferase 1378 Sushi domain (SCR repeat) 1378 Sushi domain (SCR repeat) 1379 Sushi domain (SCR repeat) 1380 Sushi domain (SCR repeat) 1381 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 1382 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 1384 Choline/Carnitine o-acyltransferase 1384 Choline/Carnitine o-acyltransferase 1384 Choline/Carnitine o-acyltransferase 1385 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 1385 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 1385 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 1385 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 1387 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1387 TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1 1387 KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun 1390 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 1392 Corticotropin-releasing factor family 1394 7 transmembrane receptor (Secretin family) 1395 7 transmembrane receptor (Secretin family) 1396 LIM domain. This family represents two copies of the LIM structural domain 1397 LIM domain. This family represents two copies of the LIM structural domain 1398 SH2 domain 1398 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 1398 SH2 domain 1398 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 1399 SH2 domain 1399 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 1400 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 1401 Pentaxin family. Pentaxins are also known as pentraxins 1404 Extracellular link domain 1406 Homeobox domain 1407 FAD binding domain of DNA photolyase 1408 FAD binding domain of DNA photolyase 1409 Hsp20/alpha crystallin family 1409 Alpha crystallin A chain, N terminal 1410 Hsp20/alpha crystallin family 1411 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1412 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1412 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1412 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1413 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1414 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1415 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1417 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1418 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1419 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1420 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1421 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1427 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 1428 Ornithine cyclodeaminase/mu-crystallin family. This family contains the bacterial Ornithine cyclodeaminase enzyme EC:4.3.1.12, which catalyses the deamination of ornithine to proline. This family also contains mu-Crystallin the major component of the eye 1431 Citrate synthase 1434 CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis 1437 Granulocyte-macrophage colony-stimulating factor 1439 Fibronectin type III domain 1440 Interleukin-6/G-CSF/MGF family 1440 Interleukin-6/G-CSF/MGF family 1440 Interleukin-6/G-CSF/MGF family 1441 Fibronectin type III domain 1441 Fibronectin type III domain 1441 Fibronectin type III domain 1441 Fibronectin type III domain 1442 Somatotropin hormone family 1442 Somatotropin hormone family 1442 Somatotropin hormone family 1443 Somatotropin hormone family 1443 Somatotropin hormone family 1443 Somatotropin hormone family 1444 Somatotropin hormone family 1444 Somatotropin hormone family 1444 Somatotropin hormone family 1448 Kappa casein. Kappa-casein is a mammalian milk protein involved in a number of important physiological processes. In the gut, the ingested protein is split into an insoluble peptide (para kappa-casein) and a soluble hydrophilic glycopeptide (caseinomacro 1452 Protein kinase domain 1453 Protein kinase domain 1453 Protein kinase domain 1454 Protein kinase domain 1454 Protein kinase domain 1455 Protein kinase domain 1456 Protein kinase domain 1457 Protein kinase domain 1457 Protein kinase domain 1459 Protein kinase domain 1460 Casein kinase II regulatory subunit 1462 Extracellular link domain 1462 Sushi domain (SCR repeat) 1462 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 1463 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 1465 LIM domain. This family represents two copies of the LIM structural domain 1466 LIM domain. This family represents two copies of the LIM structural domain 1468 Mitochondrial carrier protein 1475 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 1476 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 1478 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 1482 Homeobox domain 1490 Thrombospondin type 1 domain 1491 Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA 1491 Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA 1495 Vinculin family 1496 Vinculin family 1497 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 1499 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 1504 Trypsin 1506 Trypsin 1508 Papain family cysteine protease 1508 Papain family cysteine protease 1508 Papain family cysteine protease 1508 Papain family cysteine protease 1508 Papain family cysteine protease 1509 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 1510 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 1510 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 1512 Papain family cysteine protease 1512 Papain family cysteine protease 1515 Papain family cysteine protease 1522 Papain family cysteine protease 1523 Homeobox domain 1523 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 1524 7 transmembrane receptor (rhodopsin family) 1528 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 1534 Cytochrome b561 1536 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 1537 Cytochrome C1 family 1540 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 1543 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1544 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1545 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1548 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1549 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1549 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1551 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1553 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1555 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1557 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1558 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1558 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1559 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1562 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1565 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1571 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1572 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1573 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1576 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1577 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1579 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1580 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1581 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1582 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1583 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1584 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1585 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1586 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1588 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1588 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1589 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1591 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1592 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1592 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1593 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1594 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1595 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 1600 Phosphotyrosine interaction domain (PTB/PID) 1601 Phosphotyrosine interaction domain (PTB/PID) 1602 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 1602 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 1602 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 1603 DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon s 1604 Sushi domain (SCR repeat) 1606 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1606 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 1607 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1607 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 1607 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1607 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 1608 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1608 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 1609 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 1609 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 1610 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 1616 Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined 1617 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 1621 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 1621 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 1622 Acyl CoA binding protein 1627 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 1627 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 1630 Fibronectin type III domain 1632 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 1633 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 1634 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 1634 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 1634 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 1636 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 1636 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 1636 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 1638 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 1639 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 1642 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 1644 Pyridoxal-dependent decarboxylase conserved domain 1645 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 1646 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 1647 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 1650 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kD subunit. Members of this family are involved in asparagine-linked protein glycosylation. In particular, dolichyl-diphosphooligosaccharide-protein glycosyltransferase (DDOST), also known a 1652 Macrophage migration inhibitory factor (MIF) 1667 Mammalian defensin 1667 Defensin propeptide 1668 Mammalian defensin 1668 Defensin propeptide 1669 Mammalian defensin 1669 Defensin propeptide 1670 Mammalian defensin 1670 Defensin propeptide 1671 Mammalian defensin 1671 Defensin propeptide 1674 Intermediate filament protein 1676 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 1677 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 1678 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 1690 LCCL domain 1690 von Willebrand factor type A domain 1716 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 1716 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 1717 Ergosterol biosynthesis ERG4/ERG24 family 1718 FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi 1719 Dihydrofolate reductase 1723 Dihydroorotate dehydrogenase 1725 Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine 1725 Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine 1725 Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine 1727 Oxidoreductase FAD-binding domain 1727 Oxidoreductase FAD-binding domain 1728 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 1731 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 1733 Iodothyronine deiodinase 1734 Iodothyronine deiodinase 1734 Iodothyronine deiodinase 1735 Iodothyronine deiodinase 1736 PUA domain 1736 TruB family pseudouridylate synthase (N terminal domain) 1737 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 1743 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 1743 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 1745 Homeobox domain 1746 Homeobox domain 1747 Homeobox domain 1748 Homeobox domain 1748 Homeobox domain 1749 Homeobox domain 1750 Homeobox domain 1755 CUB domain 1755 Zona pellucida-like domain 1755 Zona pellucida-like domain 1755 Zona pellucida-like domain 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1756 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 1756 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1759 Dynamin GTPase effector domain 1759 PH domain. PH stands for pleckstrin homology 1759 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 1762 WD domain, G-beta repeat 1767 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 1769 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 1770 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 1770 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 1777 Deoxyribonuclease II 1778 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 1785 Dynamin family 1785 Dynamin GTPase effector domain 1785 PH domain. PH stands for pleckstrin homology 1786 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 1786 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 1788 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1788 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1788 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1789 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1789 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1789 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1789 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 1791 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 1794 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 1796 PTB domain (IRS-1 type) 1798 Glycosyl transferase 1800 Renal dipeptidase 1801 Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s 1802 Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s 1803 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 1804 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 1804 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 1808 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 1809 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 1811 Sulfate transporter family. Mutations may lead to several human diseases 1811 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 1812 7 transmembrane receptor (rhodopsin family) 1813 7 transmembrane receptor (rhodopsin family) 1813 7 transmembrane receptor (rhodopsin family) 1814 7 transmembrane receptor (rhodopsin family) 1814 7 transmembrane receptor (rhodopsin family) 1814 7 transmembrane receptor (rhodopsin family) 1814 7 transmembrane receptor (rhodopsin family) 1814 7 transmembrane receptor (rhodopsin family) 1815 7 transmembrane receptor (rhodopsin family) 1816 7 transmembrane receptor (rhodopsin family) 1820 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 1821 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 1822 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 1823 Cadherin domain 1823 Cadherin domain 1824 Cadherin domain 1824 Cadherin domain 1824 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1825 Cadherin domain 1825 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1825 Cadherin domain 1826 Fibronectin type III domain 1828 Cadherin domain 1828 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 1829 Cadherin domain 1830 Cadherin domain 1831 TSC-22/dip/bun family 1832 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 1833 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 1836 Sulfate transporter family. Mutations may lead to several human diseases 1840 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 1841 Thymidylate kinase 1842 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 1843 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1844 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1844 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 1845 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1846 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1846 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1847 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1848 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1848 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1849 Protein-tyrosine phosphatase 1849 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1849 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 1850 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1852 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 1854 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 1855 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 1855 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 1855 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 1855 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 1855 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 1855 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 1855 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 1855 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 1855 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D 1856 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 1856 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 1856 Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778) 1856 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 1857 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 1857 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 1857 Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778) 1857 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 1869 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 1870 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 1871 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 1874 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 1875 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 1876 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 1879 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 1880 7 transmembrane receptor (rhodopsin family) 1894 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 1901 7 transmembrane receptor (rhodopsin family) 1902 7 transmembrane receptor (rhodopsin family) 1902 7 transmembrane receptor (rhodopsin family) 1903 7 transmembrane receptor (rhodopsin family) 1906 Endothelin family 1907 Endothelin family 1908 Endothelin family 1909 7 transmembrane receptor (rhodopsin family) 1910 7 transmembrane receptor (rhodopsin family) 1910 7 transmembrane receptor (rhodopsin family) 1911 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 1912 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 1937 Elongation factor 1 gamma, conserved domain 1942 Ephrin 1942 Ephrin 1943 Ephrin 1944 Ephrin 1945 Ephrin 1945 Ephrin 1945 Ephrin 1946 Ephrin 1947 Ephrin 1948 Ephrin 1949 Ephrin 1951 7 transmembrane receptor (Secretin family) 1951 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 1951 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 1951 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 1952 7 transmembrane receptor (Secretin family) 1952 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 1952 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 1956 Furin-like cysteine rich region 1956 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 1962 3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin 1962 3-hydroxyacyl-CoA dehydrogenase, C-terminal domain. This family also includes lambda crystallin. Some proteins include two copies of this domain 1962 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 1964 Eukaryotic initiation factor 1A 1967 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 1969 Fibronectin type III domain 1969 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 1977 Eukaryotic initiation factor 4E 1982 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 1983 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 1991 Trypsin 1992 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 2002 Ets-domain 2004 Ets-domain 2005 Ets-domain 2005 Ets-domain 2009 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes 2010 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 2011 Kinase associated domain 1 2011 Kinase associated domain 1 2012 PMP-22/EMP/MP20/Claudin family 2013 PMP-22/EMP/MP20/Claudin family 2014 PMP-22/EMP/MP20/Claudin family 2015 7 transmembrane receptor (Secretin family) 2015 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 2017 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 2017 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 2018 Homeobox domain 2019 Homeobox domain 2020 Homeobox domain 2023 Enolase, C-terminal TIM barrel domain 2030 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 2033 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 2033 TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1 2033 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 2033 KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun 2035 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 2036 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 2036 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 2037 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 2038 Transglutaminase family 2038 Transglutaminase family, C-terminal ig like domain 2038 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 2040 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 2041 Fibronectin type III domain 2041 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2041 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 2042 Fibronectin type III domain 2042 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2042 Fibronectin type III domain 2042 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2043 Fibronectin type III domain 2043 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2044 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2044 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2045 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2046 Fibronectin type III domain 2046 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2047 Fibronectin type III domain 2047 Giardia variant-specific surface protein 2047 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2048 Fibronectin type III domain 2048 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2048 Fibronectin type III domain 2048 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2049 Fibronectin type III domain 2049 Giardia variant-specific surface protein 2049 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2050 Fibronectin type III domain 2050 Giardia variant-specific surface protein 2050 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2051 Fibronectin type III domain 2051 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 2051 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 2052 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 2053 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 2056 Erythropoietin/thrombopoietin 2058 WHEP-TRS domain 2058 tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyl 2063 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2063 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2064 Furin-like cysteine rich region 2064 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 2065 Furin-like cysteine rich region 2065 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 2066 Furin-like cysteine rich region 2066 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 2070 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2070 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2070 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2070 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2073 XPG I-region 2073 XPG N-terminal domain 2074 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 2077 Ets-domain 2078 Ets-domain 2078 Sterile alpha motif (SAM)/Pointed domain 2079 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 2091 Fibrillarin 2098 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 2099 Oestrogen receptor 2099 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2099 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2100 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2100 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2101 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2103 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2103 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2104 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2104 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2109 Electron transfer flavoprotein beta subunit. This protein is distantly related to and forms a heterodimer with pfam00766 2113 Ets-domain 2113 Sterile alpha motif (SAM)/Pointed domain 2114 Ets-domain 2115 Ets-domain 2117 Ets-domain 2118 Ets-domain 2119 Ets-domain 2120 Ets-domain 2122 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 2128 Homeobox domain 2131 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 2132 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 2134 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 2137 Uncharacterized ACR, COG1579 2137 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 2138 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2138 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2138 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2138 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2139 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2139 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2139 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2139 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2139 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2140 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 2145 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 2146 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 2146 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 2147 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 2149 7 transmembrane receptor (rhodopsin family) 2150 7 transmembrane receptor (rhodopsin family) 2151 7 transmembrane receptor (rhodopsin family) 2152 Tissue factor 2153 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 2153 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 2155 Trypsin 2155 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 2155 Trypsin 2155 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 2157 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 2157 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 2157 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 2158 Trypsin 2158 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 2159 Trypsin 2159 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 2159 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 2160 Trypsin 2161 Trypsin 2161 Fibronectin type I domain 2161 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 2161 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 2162 Transglutaminase family 2162 Transglutaminase family, C-terminal ig like domain 2162 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 2165 Sushi domain (SCR repeat) 2166 Amidase 2167 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2168 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2169 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2170 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2171 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2172 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2173 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 2175 Fanconi anaemia group A protein 2176 Fanconi anaemia group C protein 2184 Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the ty 2185 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 2185 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 2185 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 2185 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 2186 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 2186 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 2186 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 2186 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 2186 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 2186 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 2191 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 2194 Zinc-binding dehydrogenase 2194 Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid th 2195 Cadherin domain 2195 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 2196 Cadherin domain 2196 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 2196 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 2200 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 2201 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 2203 Fructose-1-6-bisphosphatase 2206 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 2208 Lectin C-type domain. This family includes both long and short form C-type 2217 Class I Histocompatibility antigen, domains alpha 1 and 2 2222 Squalene/phytoene synthase 2224 Polyprenyl synthetase 2235 Ferrochelatase 2239 Glypican 2241 SH2 domain 2241 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 2242 SH2 domain 2242 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 2244 Fibrinogen beta and gamma chains, C-terminal globular domain 2245 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 2245 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 2246 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2246 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2246 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2247 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2248 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2249 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2250 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2250 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2251 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2252 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2253 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2253 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2253 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2253 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2254 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 2262 Glypican 2266 Fibrinogen beta and gamma chains, C-terminal globular domain 2266 Fibrinogen beta and gamma chains, C-terminal globular domain 2268 SH2 domain 2268 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 2272 HIT family 2274 LIM domain. This family represents two copies of the LIM structural domain 2275 LIM domain. This family represents two copies of the LIM structural domain 2281 FKBP-type peptidyl-prolyl cis-trans isomerase 2286 FKBP-type peptidyl-prolyl cis-trans isomerase 2286 FKBP-type peptidyl-prolyl cis-trans isomerase 2287 FKBP-type peptidyl-prolyl cis-trans isomerase 2288 FKBP-type peptidyl-prolyl cis-trans isomerase 2289 FKBP-type peptidyl-prolyl cis-trans isomerase 2290 Fork head domain 2296 Fork head domain 2297 Fork head domain 2301 Fork head domain 2302 Fork head domain 2303 Fork head domain 2306 Fork head domain 2308 Fork head domain 2313 Ets-domain 2316 Filamin/ABP280 repeat 2317 Filamin/ABP280 repeat 2318 Filamin/ABP280 repeat 2319 Flotillin family. Flotillins are integral membrane proteins that have been shown to be present in several subcellular components, including caveolae (invaginated plasma membrane microdomains), lipid rafts (sphingolipid and cholesterol-rich, detergent-res 2322 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 2323 pfam02947, flt3_lig, flt3 ligand. The flt3 ligand is a short chain cytokine with a 4 helical bundle fold 2326 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 2327 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 2328 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 2329 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 2330 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 2335 Fibronectin type I domain 2335 Fibronectin type III domain 2335 Fibronectin type I domain 2346 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 2346 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 2348 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2348 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2348 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2348 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2348 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2348 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2350 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2352 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 2357 7 transmembrane receptor (rhodopsin family) 2358 7 transmembrane receptor (rhodopsin family) 2359 7 transmembrane receptor (rhodopsin family) 2395 Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain 2395 Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain 2475 Phosphatidylinositol 3- and 4-kinase 2475 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 2487 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 2492 7 transmembrane receptor (rhodopsin family) 2492 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 2492 7 transmembrane receptor (rhodopsin family) 2492 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 2494 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2494 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2495 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 2512 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 2515 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 2515 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 2516 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2516 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2517 Alpha-L-fucosidase 2519 Alpha-L-fucosidase 2520 Gastrin/cholecystokinin family 2521 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 2523 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 2524 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 2525 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 2526 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 2527 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 2528 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 2529 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 2531 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 2534 SH2 domain 2534 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 2534 SH2 domain 2534 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 2534 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 2535 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 2547 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 2548 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 2550 7 transmembrane receptor (metabotropic glutamate family) 2550 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2550 7 transmembrane receptor (metabotropic glutamate family) 2550 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2550 7 transmembrane receptor (metabotropic glutamate family) 2550 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2550 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2551 Ets-domain 2551 Sterile alpha motif (SAM)/Pointed domain 2554 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2554 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2555 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2555 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2556 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2556 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2557 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2557 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2558 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2558 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2559 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2559 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2560 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2560 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2561 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2561 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2561 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2561 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2562 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2562 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2562 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2562 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2563 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2563 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2564 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2564 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2564 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2564 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2564 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2564 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2564 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2564 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2565 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2565 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2566 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2566 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2567 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2567 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2568 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2568 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2569 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2569 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2570 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2570 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2571 Pyridoxal-dependent decarboxylase conserved domain 2571 Pyridoxal-dependent decarboxylase conserved domain 2572 Pyridoxal-dependent decarboxylase conserved domain 2580 Protein-tyrosine phosphatase 2581 Glycosyl hydrolase family 59 2583 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 2587 7 transmembrane receptor (rhodopsin family) 2588 Sulfatase 2589 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 2590 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 2591 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 2592 Galactose-1-phosphate uridyl transferase, N-terminal domain. SCOP reports fold duplication with C-terminal domain. Both involved in Zn and Fe binding 2592 Galactose-1-phosphate uridyl transferase, N-terminal domain. SCOP reports fold duplication with C-terminal domain. Both involved in Zn and Fe binding 2617 WHEP-TRS domain 2624 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d 2629 O-Glycosyl hydrolase family 30 2631 ssDNA binding protein 2633 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 2633 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 2634 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 2634 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 2635 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 2635 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 2637 Homeobox domain 2641 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 2642 7 transmembrane receptor (Secretin family) 2642 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 2645 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 2645 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 2645 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 2649 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2649 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2649 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2649 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2649 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2649 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2650 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 2651 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 2651 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 2651 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 2652 7 transmembrane receptor (rhodopsin family) 2653 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl grou 2658 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 2660 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 2664 GDP dissociation inhibitor 2665 GDP dissociation inhibitor 2668 Transforming growth factor beta like domain 2670 Intermediate filament protein 2674 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 2674 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 2675 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 2676 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 2678 Gamma-glutamyltranspeptidase 2678 Gamma-glutamyltranspeptidase 2678 Gamma-glutamyltranspeptidase 2679 Gamma-glutamyltranspeptidase 2683 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 2687 Gamma-glutamyltranspeptidase 2688 Somatotropin hormone family 2688 Somatotropin hormone family 2688 Somatotropin hormone family 2688 Somatotropin hormone family 2689 Somatotropin hormone family 2689 Somatotropin hormone family 2689 Somatotropin hormone family 2689 Somatotropin hormone family 2690 Fibronectin type III domain 2691 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 2692 7 transmembrane receptor (Secretin family) 2693 7 transmembrane receptor (rhodopsin family) 2694 Eukaryotic cobalamin-binding protein 2695 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 2696 7 transmembrane receptor (Secretin family) 2697 Connexin 2697 Gap junction alpha-1 protein (Cx43) 2700 Connexin 2701 Connexin 2702 Connexin 2702 Connexin 2703 Connexin 2703 Gap junction alpha-8 protein (Cx50) 2705 Connexin 2706 Connexin 2707 Connexin 2709 Connexin 2717 Melibiase 2719 Glypican 2720 Glycosyl hydrolases family 35 2729 Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. T 2734 Cysteine rich repeat. This cysteine rich repeat contains four cysteines. It is found in multiple copies in a protein that binds to fibroblast growth factors. The repeat is also found in MG160 and E-selectin ligand (ESL-1) 2739 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 2740 7 transmembrane receptor (Secretin family) 2741 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2741 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2742 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2742 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2743 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 2743 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 2745 Glutaredoxin 2747 Glu/Leu/Phe/Val dehydrogenase, dimerisation domain 2747 Glutamate/Leucine/Phenylalanine/Valine dehydrogenase 2752 Glutamine synthetase, catalytic domain 2764 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 2766 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 2767 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2768 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2769 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2770 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2771 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2773 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2774 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2775 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2776 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2778 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2778 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2778 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2779 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2779 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2780 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2781 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 2794 GTPase of unknown function 2798 7 transmembrane receptor (rhodopsin family) 2799 Sulfatase 2800 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 2803 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 2810 14-3-3 protein 2810 14-3-3 protein 2813 Zona pellucida-like domain 2815 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 2817 Glypican 2819 NAD-dependent glycerol-3-phosphate dehydrogenase 2821 Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate 2823 Myelin proteolipid protein (PLP or lipophilin) 2824 Myelin proteolipid protein (PLP or lipophilin) 2825 7 transmembrane receptor (rhodopsin family) 2826 7 transmembrane receptor (rhodopsin family) 2827 7 transmembrane receptor (rhodopsin family) 2828 7 transmembrane receptor (rhodopsin family) 2829 7 transmembrane receptor (rhodopsin family) 2830 7 transmembrane receptor (rhodopsin family) 2831 7 transmembrane receptor (rhodopsin family) 2832 7 transmembrane receptor (rhodopsin family) 2833 7 transmembrane receptor (rhodopsin family) 2834 7 transmembrane receptor (rhodopsin family) 2835 7 transmembrane receptor (rhodopsin family) 2837 7 transmembrane receptor (rhodopsin family) 2838 7 transmembrane receptor (rhodopsin family) 2840 7 transmembrane receptor (rhodopsin family) 2841 7 transmembrane receptor (rhodopsin family) 2842 7 transmembrane receptor (rhodopsin family) 2843 7 transmembrane receptor (rhodopsin family) 2844 7 transmembrane receptor (rhodopsin family) 2845 7 transmembrane receptor (rhodopsin family) 2846 7 transmembrane receptor (rhodopsin family) 2847 7 transmembrane receptor (rhodopsin family) 2848 7 transmembrane receptor (rhodopsin family) 2849 7 transmembrane receptor (rhodopsin family) 2850 7 transmembrane receptor (rhodopsin family) 2852 7 transmembrane receptor (rhodopsin family) 2853 7 transmembrane receptor (rhodopsin family) 2854 7 transmembrane receptor (rhodopsin family) 2857 7 transmembrane receptor (rhodopsin family) 2859 7 transmembrane receptor (rhodopsin family) 2861 7 transmembrane receptor (rhodopsin family) 2862 7 transmembrane receptor (rhodopsin family) 2863 7 transmembrane receptor (rhodopsin family) 2864 7 transmembrane receptor (rhodopsin family) 2865 7 transmembrane receptor (rhodopsin family) 2866 7 transmembrane receptor (rhodopsin family) 2867 7 transmembrane receptor (rhodopsin family) 2868 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 2869 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 2870 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 2876 Glutathione peroxidase 2877 Glutathione peroxidase 2878 Glutathione peroxidase 2880 Glutathione peroxidase 2880 Glutathione peroxidase 2885 SH2 domain 2885 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 2890 Bacterial extracellular solute-binding proteins, family 3 2890 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 2890 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2891 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2892 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2892 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2893 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2894 Bacterial extracellular solute-binding proteins, family 3 2894 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2895 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2896 Granulin 2897 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2897 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2898 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2898 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2899 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2900 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2901 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2902 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2902 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2902 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2903 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2904 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2905 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2906 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2907 Uncharacterized protein family UPF0005 2908 Glucocorticoid receptor 2908 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 2908 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 2909 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 2909 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 2911 7 transmembrane receptor (metabotropic glutamate family) 2911 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2912 7 transmembrane receptor (metabotropic glutamate family) 2912 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2913 7 transmembrane receptor (metabotropic glutamate family) 2913 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2914 7 transmembrane receptor (metabotropic glutamate family) 2914 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2915 7 transmembrane receptor (metabotropic glutamate family) 2915 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2916 7 transmembrane receptor (metabotropic glutamate family) 2916 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2917 7 transmembrane receptor (metabotropic glutamate family) 2917 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2917 7 transmembrane receptor (metabotropic glutamate family) 2917 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2917 7 transmembrane receptor (metabotropic glutamate family) 2917 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2918 7 transmembrane receptor (metabotropic glutamate family) 2918 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2919 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 2920 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 2921 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 2923 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 2925 7 transmembrane receptor (rhodopsin family) 2926 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 2928 Homeobox domain 2937 Eukaryotic glutathione synthase 2937 Eukaryotic glutathione synthase, ATP binding domain 2938 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2939 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2940 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2941 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2944 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2944 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2946 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2947 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2948 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2948 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2948 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2949 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2950 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 2956 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 2958 Transcription initiation factor IIA, gamma subunit, helical domain 2958 Transcription initiation factor IIA, gamma subunit, beta-barrel domain 2960 TFIIE alpha subunit 2961 TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription 2963 Transcription initiation factor IIF 2966 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex 2967 Transcription factor Tfb4 2968 Transcription factor Tfb2 2969 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 2969 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 2969 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 2969 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 2969 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 2977 Adenylate and Guanylate cyclase catalytic domain 2980 Guanylin precursor 2981 Guanylin precursor 2983 Adenylate and Guanylate cyclase catalytic domain 2984 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2986 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 2990 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 2990 Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold 2993 Glycophorin A 2994 Glycophorin A 2997 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 2998 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 3000 Protein kinase domain 3000 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 3001 Trypsin 3002 Trypsin 3003 Trypsin 3005 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3006 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3007 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3008 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3009 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3010 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3024 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 3028 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3029 Metallo-beta-lactamase superfamily 3030 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 3032 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 3034 Phenylalanine and histidine ammonia-lyase 3039 Globin 3040 Globin 3043 Globin 3045 Globin 3046 Globin 3047 Globin 3048 Globin 3049 Globin 3050 Globin 3052 Cytochrome c/c1 heme lyase 3053 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 3055 SH2 domain 3055 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 3059 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 3060 Prepro-orexin 3061 7 transmembrane receptor (rhodopsin family) 3062 Orexin receptor type 2 3062 7 transmembrane receptor (rhodopsin family) 3064 Huntingtin 3065 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 3066 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 3067 Pyridoxal-dependent decarboxylase conserved domain 3068 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 3069 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 3073 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 3074 Glycosyl hydrolase family 20, domain 2. This domain has a zincin-like fold 3074 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 3075 Sushi domain (SCR repeat) 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3077 Class I Histocompatibility antigen, domains alpha 1 and 2 3078 Sushi domain (SCR repeat) 3080 Sushi domain (SCR repeat) 3081 homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme 3082 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 3083 Trypsin 3083 Fibronectin type I domain 3083 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 3084 Neuregulin family 3084 Neuregulin family 3084 Neuregulin family 3084 Neuregulin family 3084 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 3087 Homeobox domain 3090 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 3094 HIT family 3096 Zinc finger 3097 Zinc finger 3098 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3098 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3098 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3098 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3098 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3099 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3101 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 3101 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF03727. Some members of the family have two copies of each of these domains 3104 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 3105 Class I Histocompatibility antigen, domains alpha 1 and 2 3106 Class I Histocompatibility antigen, domains alpha 1 and 2 3107 Class I Histocompatibility antigen, domains alpha 1 and 2 3108 Class II histocompatibility antigen, alpha domain 3109 Class II histocompatibility antigen, beta domain 3110 Homeobox domain 3111 Class II histocompatibility antigen, alpha domain 3112 Class II histocompatibility antigen, beta domain 3113 Class II histocompatibility antigen, alpha domain 3115 Class II histocompatibility antigen, beta domain 3117 Class II histocompatibility antigen, alpha domain 3118 Class II histocompatibility antigen, alpha domain 3119 Class II histocompatibility antigen, beta domain 3120 Class II histocompatibility antigen, beta domain 3120 Class II histocompatibility antigen, beta domain 3120 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 3122 Class II histocompatibility antigen, alpha domain 3123 Class II histocompatibility antigen, beta domain 3125 Class II histocompatibility antigen, beta domain 3126 Class II histocompatibility antigen, beta domain 3127 Class II histocompatibility antigen, beta domain 3133 Class I Histocompatibility antigen, domains alpha 1 and 2 3134 Class I Histocompatibility antigen, domains alpha 1 and 2 3135 Class I Histocompatibility antigen, domains alpha 1 and 2 3140 Class I Histocompatibility antigen, domains alpha 1 and 2 3142 Homeobox domain 3148 HMG (high mobility group) box 3149 HMG (high mobility group) box 3155 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 3156 Hydroxymethylglutaryl-coenzyme A reductase 3156 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 3157 Hydroxymethylglutaryl-coenzyme A synthase 3158 Hydroxymethylglutaryl-coenzyme A synthase 3162 Heme oxygenase 3163 Heme oxygenase 3164 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 3164 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3164 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 3164 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3164 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3166 Homeobox domain 3169 Fork head domain 3170 Fork head domain 3170 Fork head domain 3171 Fork head domain 3172 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 3172 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3172 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 3172 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3172 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 3172 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3174 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 3174 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 3175 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 3177 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 3187 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 3188 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 3190 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 3190 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 3190 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 3192 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 3192 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 3195 Homeobox domain 3196 Homeobox domain 3196 Homeobox domain 3198 Homeobox domain 3199 Homeobox domain 3200 Homeobox domain 3200 Homeobox domain 3200 Homeobox domain 3201 Homeobox domain 3202 Homeobox domain 3203 Homeobox domain 3204 Homeobox domain 3205 Homeobox domain 3205 Homeobox domain 3206 Homeobox domain 3206 Homeobox domain 3207 Homeobox domain 3209 Homeobox domain 3211 Homeobox domain 3212 Homeobox domain 3213 Homeobox domain 3214 Homeobox domain 3215 Homeobox domain 3216 Homeobox domain 3216 Homeobox domain 3217 Homeobox domain 3218 Homeobox domain 3219 Homeobox domain 3221 Homeobox domain 3221 Homeobox domain 3222 Homeobox domain 3223 Homeobox domain 3223 Homeobox domain 3224 Homeobox domain 3225 Homeobox domain 3226 Homeobox domain 3227 Homeobox domain 3228 Homeobox domain 3229 Homeobox domain 3231 Homeobox domain 3232 Homeobox domain 3233 Homeobox domain 3234 Homeobox domain 3235 Homeobox domain 3236 Homeobox domain 3237 Homeobox domain 3238 Homeobox domain 3239 Homeobox domain 3240 Trypsin 3248 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3249 Trypsin 3263 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 3266 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 3269 7 transmembrane receptor (rhodopsin family) 3273 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 3274 7 transmembrane receptor (rhodopsin family) 3280 Helix-loop-helix DNA-binding domain 3283 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 3284 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 3290 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3290 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3291 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3292 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3293 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3294 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3295 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 3295 MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes 3295 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 3297 HSF-type DNA-binding 3298 HSF-type DNA-binding 3299 HSF-type DNA-binding 3303 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3304 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3305 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3306 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3308 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 3309 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3310 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3312 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3312 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3313 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 3315 Hsp20/alpha crystallin family 3316 Hsp20/alpha crystallin family 3320 Hsp90 protein 3324 Hsp90 protein 3324 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 3326 Hsp90 protein 3336 Chaperonin 10 Kd subunit 3339 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3340 Sulfotransferase protein 3344 Fork head domain 3350 7 transmembrane receptor (rhodopsin family) 3351 7 transmembrane receptor (rhodopsin family) 3352 7 transmembrane receptor (rhodopsin family) 3354 7 transmembrane receptor (rhodopsin family) 3355 7 transmembrane receptor (rhodopsin family) 3356 7 transmembrane receptor (rhodopsin family) 3357 7 transmembrane receptor (rhodopsin family) 3358 7 transmembrane receptor (rhodopsin family) 3359 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 3359 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 3360 7 transmembrane receptor (rhodopsin family) 3361 7 transmembrane receptor (rhodopsin family) 3362 7 transmembrane receptor (rhodopsin family) 3363 7 transmembrane receptor (rhodopsin family) 3363 7 transmembrane receptor (rhodopsin family) 3363 7 transmembrane receptor (rhodopsin family) 3364 Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a 3371 Fibronectin type III domain 3373 Hyaluronidase 3373 Hyaluronidase 3373 Hyaluronidase 3373 Hyaluronidase 3373 Hyaluronidase 3373 Hyaluronidase 3373 Hyaluronidase 3383 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 3384 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 3394 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 3396 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly con 3398 Helix-loop-helix DNA-binding domain 3400 Helix-loop-helix DNA-binding domain 3423 Sulfatase 3423 Sulfatase 3425 Glycosyl hydrolases family 39 3426 Trypsin 3426 Low-density lipoprotein receptor domain class A 3426 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 3428 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 3428 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 3431 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 3431 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 3431 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 3431 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 3431 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 3431 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 3439 Interferon alpha/beta domain 3440 Interferon alpha/beta domain 3441 Interferon alpha/beta domain 3442 Interferon alpha/beta domain 3443 Interferon alpha/beta domain 3444 Interferon alpha/beta domain 3445 Interferon alpha/beta domain 3446 Interferon alpha/beta domain 3447 Interferon alpha/beta domain 3448 Interferon alpha/beta domain 3449 Interferon alpha/beta domain 3451 Interferon alpha/beta domain 3452 Interferon alpha/beta domain 3455 Tissue factor 3456 Interferon alpha/beta domain 3458 Interferon gamma 3460 Fibronectin type III domain 3467 Interferon alpha/beta domain 3476 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 3479 Insulin/IGF/Relaxin family 3480 Fibronectin type III domain 3480 Furin-like cysteine rich region 3480 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 3481 Insulin/IGF/Relaxin family 3482 Cation-independent mannose-6-phosphate receptor repeat. The cation-independent mannose-6-phosphate receptor contains 15 copies of a repeat 3483 Leucine rich repeat C-terminal domain 3484 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 3485 Thyroglobulin type-1 repeat 3485 Insulin-like growth factor binding proteins 3486 Thyroglobulin type-1 repeat 3486 Insulin-like growth factor binding protein 3487 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 3488 Thyroglobulin type-1 repeat 3488 Insulin-like growth factor binding proteins 3489 Thyroglobulin type-1 repeat 3489 Insulin-like growth factor binding proteins 3491 Thrombospondin type 1 domain 3491 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 3516 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 3549 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 3549 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 3551 Protein kinase domain 3553 Interleukin-1 propeptide. The Interleukin-1 cytokines are translated as precursor proteins. The N terminal approx. 115 amino acids form a propeptide that is cleaved off to release the active interleukin-1 3554 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 3556 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 3558 Interleukin 2 3559 Sushi domain (SCR repeat) 3562 Interleukin-3 3565 Interleukin 4 3565 Interleukin 4 3566 Fibronectin type III domain 3567 Interleukin 5 3569 Interleukin-6/G-CSF/MGF family 3572 Fibronectin type III domain 3572 Fibronectin type III domain 3575 Fibronectin type III domain 3576 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 3577 7 transmembrane receptor (rhodopsin family) 3578 Interleukin 7/9 family. IL-7 is a cytokine that acts as a growth factor for early lymphoid cells of both B- and T-cell lineages. IL-9 is a multi-functional cytokine that, although originally described as a T-cell growth factor, its function in T-cell res 3579 7 transmembrane receptor (rhodopsin family) 3586 Interleukin 10 3588 Tissue factor 3592 Interleukin-12 alpha subunit. Interleukin 12 (IL-12) is a disulphide-bonded heterodimer consisting of a 35kDa alpha subunit and a 40kDa beta subunit. It is involved in the stimulation and maintenance of Th1 cellular immune responses, including the normal 3594 Fibronectin type III domain 3595 Fibronectin type III domain 3600 Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses 3600 Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses 3600 Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses 3606 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 3607 Fork head domain 3607 Fork head domain 3609 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 3609 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 3609 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 3611 Protein kinase domain 3612 Inositol monophosphatase family 3613 Inositol monophosphatase family 3614 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 3615 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 3617 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 3619 Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregati 3620 Indoleamine 2,3-dioxygenase 3624 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 3625 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 3626 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 3627 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 3628 Inositol monophosphatase family 3630 Insulin/IGF/Relaxin family. Superfamily includes insulins 3643 Fibronectin type III domain 3643 Furin-like cysteine rich region 3643 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 3645 Fibronectin type III domain 3645 Furin-like cysteine rich region 3645 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 3645 Fibronectin type III domain 3645 Furin-like cysteine rich region 3645 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 3646 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 3651 Homeobox domain 3652 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 3654 Protein kinase domain 3656 Death domain 3656 Protein kinase domain 3658 Aconitase family (aconitate hydratase) 3658 Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism 3659 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 3660 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 3662 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 3667 PTB domain (IRS-1 type) 3670 LIM domain. This family represents two copies of the LIM structural domain 3671 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 3672 von Willebrand factor type A domain 3673 von Willebrand factor type A domain 3674 Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains 3678 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 3681 von Willebrand factor type A domain 3682 von Willebrand factor type A domain 3683 von Willebrand factor type A domain 3684 von Willebrand factor type A domain 3684 Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains 3688 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3688 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3688 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3688 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3688 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3688 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3689 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3690 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3691 Calx-beta domain 3691 Fibronectin type III domain 3691 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3692 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues 3692 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues 3693 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3694 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3695 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3696 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 3697 von Willebrand factor type A domain 3702 SH2 domain 3702 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 3702 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 3703 Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase 3706 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 3707 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 3708 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 3708 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3709 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 3709 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 3710 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 3710 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 3714 Delta serrate ligand 3714 Delta serrate ligand 3716 Protein kinase domain 3720 jmjC domain 3720 jmjN domain 3720 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 3720 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 3725 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 3725 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 3726 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 3726 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 3727 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 3727 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 3728 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 3728 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 3730 Fibronectin type III domain 3732 Tetraspanin family 3736 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3736 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3737 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3737 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3738 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3738 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3739 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3739 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3741 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3741 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3742 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3742 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3743 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3743 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3744 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3744 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3745 Kv2 voltage-gated K+ channel 3745 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3745 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3746 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3746 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3747 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3747 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3747 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3747 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3747 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3747 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3748 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3748 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3749 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3749 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3749 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3749 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3750 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3750 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3751 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3751 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3752 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3752 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3752 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3752 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3753 Slow voltage-gated potassium channel 3754 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3754 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3755 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3755 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3755 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3756 Cyclic nucleotide-binding domain 3756 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 3756 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3756 Cyclic nucleotide-binding domain 3756 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 3756 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3757 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 3757 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3757 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 3757 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3757 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3758 Inward rectifier potassium channel 3758 Inward rectifier potassium channel 3758 Inward rectifier potassium channel 3758 Inward rectifier potassium channel 3758 Inward rectifier potassium channel 3759 Inward rectifier potassium channel 3760 Inward rectifier potassium channel 3761 Inward rectifier potassium channel 3761 Inward rectifier potassium channel 3762 Inward rectifier potassium channel 3763 Inward rectifier potassium channel 3764 Inward rectifier potassium channel 3765 Inward rectifier potassium channel 3766 Inward rectifier potassium channel 3767 Inward rectifier potassium channel 3768 Inward rectifier potassium channel 3769 Inward rectifier potassium channel 3770 Inward rectifier potassium channel 3770 Inward rectifier potassium channel 3772 Inward rectifier potassium channel 3772 Inward rectifier potassium channel 3772 Inward rectifier potassium channel 3773 Inward rectifier potassium channel 3773 Inward rectifier potassium channel 3773 Inward rectifier potassium channel 3777 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3778 Calcium-activated BK potassium channel alpha subunit 3778 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3779 Calcium-activated potassium channel, beta subunit 3780 Calcium-activated SK potassium channel 3780 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 3781 Calcium-activated SK potassium channel 3781 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 3781 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 3782 Calcium-activated SK potassium channel 3782 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 3782 Calcium-activated SK potassium channel 3782 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 3783 Calcium-activated SK potassium channel 3783 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 3784 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3784 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3784 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3784 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3784 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3784 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3785 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3785 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3785 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3785 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3785 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3785 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3785 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3785 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3785 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3786 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 3786 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3787 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3787 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3788 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3788 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 3790 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 3790 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 3795 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 3795 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 3797 Kinesin motor domain 3798 Kinesin motor domain 3799 Kinesin motor domain 3800 Kinesin motor domain 3801 Kinesin motor domain 3818 Trypsin 3827 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 3848 Intermediate filament protein 3849 Intermediate filament protein 3850 Intermediate filament protein 3851 Intermediate filament protein 3852 Intermediate filament protein 3853 Intermediate filament protein 3854 Intermediate filament protein 3855 Intermediate filament protein 3856 Intermediate filament protein 3857 Intermediate filament protein 3858 Intermediate filament protein 3859 Intermediate filament protein 3860 Intermediate filament protein 3860 Intermediate filament protein 3861 Intermediate filament protein 3866 Intermediate filament protein 3868 Intermediate filament protein 3872 Intermediate filament protein 3875 Intermediate filament protein 3880 Intermediate filament protein 3881 Intermediate filament protein 3882 Intermediate filament protein 3883 Intermediate filament protein 3884 Intermediate filament protein 3885 Intermediate filament protein 3886 Intermediate filament protein 3887 Intermediate filament protein 3888 Intermediate filament protein 3889 Intermediate filament protein 3890 Intermediate filament protein 3891 Intermediate filament protein 3892 Intermediate filament protein 3897 Fibronectin type III domain 3897 Fibronectin type III domain 3897 Fibronectin type III domain 3897 Fibronectin type III domain 3898 7 transmembrane receptor (rhodopsin family) 3906 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme 3908 Laminin G domain 3908 Laminin N-terminal (Domain VI) 3908 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3910 Giardia variant-specific surface protein 3911 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3912 Laminin N-terminal (Domain VI) 3912 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3913 Laminin N-terminal (Domain VI) 3913 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3914 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3915 Laminin B (Domain IV) 3915 Laminin N-terminal (Domain VI) 3915 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3916 Lysosome-associated membrane glycoprotein (Lamp) 3918 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3918 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 3918 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 3918 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 3920 Lysosome-associated membrane glycoprotein (Lamp) 3920 Lysosome-associated membrane glycoprotein (Lamp) 3921 Ribosomal protein S2 3925 Stathmin family 3927 LIM domain. This family represents two copies of the LIM structural domain 3929 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 3930 Ergosterol biosynthesis ERG4/ERG24 family 3931 Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol 3932 SH2 domain 3933 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 3934 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 3936 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 3937 SH2 domain 3938 Glycosyl hydrolase family 1 3949 Low-density lipoprotein receptor domain class A 3949 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 3952 Leptin 3955 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 3956 Galactoside-binding lectin 3957 Galactoside-binding lectin 3958 Galactoside-binding lectin 3973 7 transmembrane receptor (rhodopsin family) 3975 Homeobox domain 3975 LIM domain. This family represents two copies of the LIM structural domain 3976 LIF / OSM family 3980 ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653 3980 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc 3980 ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653 3980 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 3980 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc 3981 ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653 3981 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 3982 PMP-22/EMP/MP20/Claudin family 3983 Villin headpiece domain 3983 LIM domain. This family represents two copies of the LIM structural domain 3983 Villin headpiece domain 3983 LIM domain. This family represents two copies of the LIM structural domain 3984 Protein kinase domain 3984 LIM domain. This family represents two copies of the LIM structural domain 3984 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 3984 LIM domain. This family represents two copies of the LIM structural domain 3984 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 3985 LIM domain. This family represents two copies of the LIM structural domain 3985 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 3985 LIM domain. This family represents two copies of the LIM structural domain 3985 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 3987 LIM domain. This family represents two copies of the LIM structural domain 3988 ab-hydrolase associated lipase region 3990 Lipase 3992 Fatty acid desaturase 3992 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 3995 Fatty acid desaturase 3995 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 3998 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 4000 Intermediate filament protein 4000 Intermediate filament tail domain 4000 Intermediate filament protein 4000 Intermediate filament tail domain 4000 Intermediate filament protein 4000 Intermediate filament tail domain 4001 Intermediate filament protein 4001 Intermediate filament tail domain 4004 LIM domain. This family represents two copies of the LIM structural domain 4005 LIM domain. This family represents two copies of the LIM structural domain 4007 LIM domain. This family represents two copies of the LIM structural domain 4008 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 4009 Homeobox domain 4009 LIM domain. This family represents two copies of the LIM structural domain 4010 Homeobox domain 4010 LIM domain. This family represents two copies of the LIM structural domain 4012 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 4015 Lysyl oxidase 4016 Lysyl oxidase 4017 Lysyl oxidase 4018 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 4023 Lipase 4023 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 4025 Animal haem peroxidase 4033 Uncharacterized ACR, COG1579 4035 Low-density lipoprotein receptor domain class A 4035 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 4036 Low-density lipoprotein receptor domain class A 4036 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 4038 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 4040 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 4041 Low-density lipoprotein receptor domain class A 4041 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 4046 Caldesmon 4048 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 4050 TNF(Tumor Necrosis Factor) family 4051 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 4052 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 4053 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 4054 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 4055 TNFR/NGFR cysteine-rich region 4056 MAPEG family 4056 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4057 Transferrin 4061 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 4062 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 4064 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 4065 Lectin C-type domain. This family includes both long and short form C-type 4067 SH2 domain 4067 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 4070 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 4072 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 4074 Cation-dependent mannose-6-phosphate receptor 4077 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 4077 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 4077 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 4081 Mab-21 protein 4084 Helix-loop-helix DNA-binding domain 4085 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 4086 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4086 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4087 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4087 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4088 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4088 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4089 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4089 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4090 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4090 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4092 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4092 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4093 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 4093 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 4094 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 4097 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 4100 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4101 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4101 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4101 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4102 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4103 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4104 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4105 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4105 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4107 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4108 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4109 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4110 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4111 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4112 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4112 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4112 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4113 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4114 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4115 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4116 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the 4121 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 4122 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 4124 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 4125 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 4128 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 4129 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 4133 Tau and MAP protein, tubulin-binding repeat 4133 Tau and MAP protein, tubulin-binding repeat 4133 Tau and MAP protein, tubulin-binding repeat 4133 Tau and MAP protein, tubulin-binding repeat 4134 Tau and MAP protein, tubulin-binding repeat 4134 Tau and MAP protein, tubulin-binding repeat 4134 Tau and MAP protein, tubulin-binding repeat 4134 Tau and MAP protein, tubulin-binding repeat 4137 Tau and MAP protein, tubulin-binding repeat 4137 Tau and MAP protein, tubulin-binding repeat 4137 Tau and MAP protein, tubulin-binding repeat 4137 Tau and MAP protein, tubulin-binding repeat 4139 Kinase associated domain 1 4140 Kinase associated domain 1 4141 WHEP-TRS domain 4142 7 transmembrane receptor (rhodopsin family) 4145 SH2 domain 4145 Protein kinase domain 4145 SH2 domain 4145 Protein kinase domain 4145 SH2 domain 4145 Protein kinase domain 4146 von Willebrand factor type A domain 4147 von Willebrand factor type A domain 4147 von Willebrand factor type A domain 4147 von Willebrand factor type A domain 4147 von Willebrand factor type A domain 4148 von Willebrand factor type A domain 4149 Helix-loop-helix DNA-binding domain 4149 Helix-loop-helix DNA-binding domain 4149 Helix-loop-helix DNA-binding domain 4149 Helix-loop-helix DNA-binding domain 4149 Helix-loop-helix DNA-binding domain 4151 Globin 4152 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 4152 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 4152 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 4152 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 4152 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 4152 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 4152 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 4152 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 4152 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 4152 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 4153 Lectin C-type domain. This family includes both long and short form C-type 4155 Myelin basic protein 4157 7 transmembrane receptor (rhodopsin family) 4158 7 transmembrane receptor (rhodopsin family) 4159 7 transmembrane receptor (rhodopsin family) 4160 7 transmembrane receptor (rhodopsin family) 4161 7 transmembrane receptor (rhodopsin family) 4163 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 4168 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 4170 Apoptosis regulator proteins, Bcl-2 family 4170 Apoptosis regulator proteins, Bcl-2 family 4173 MCM2/3/5 family 4175 MCM2/3/5 family 4176 MCM2/3/5 family 4176 MCM2/3/5 family 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4179 Sushi domain (SCR repeat) 4185 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 4185 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 4185 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 4185 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 4192 PTN/MK heparin-binding protein family 4193 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do 4193 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do 4194 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do 4199 Malic enzyme, NAD binding domain 4205 SRF-type transcription factor (DNA-binding and dimerisation domain) 4207 SRF-type transcription factor (DNA-binding and dimerisation domain) 4208 SRF-type transcription factor (DNA-binding and dimerisation domain) 4209 SRF-type transcription factor (DNA-binding and dimerisation domain) 4210 B-box zinc finger 4210 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 4214 Protein kinase domain 4218 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 4222 Homeobox domain 4223 Homeobox domain 4224 MAM domain. An extracellular domain found in many receptors 4224 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 4224 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 4224 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 4225 MAM domain. An extracellular domain found in many receptors 4225 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 4225 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 4233 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 4233 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 4233 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 4234 Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity 4234 Putative methyltransferase 4240 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 4240 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 4241 Transferrin 4241 Transferrin 4242 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 4245 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o 4246 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 4250 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 4254 Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site 4254 Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site 4255 6-O-methylguanine DNA methyltransferase, DNA binding domain. This domain is a 3 helical bundle 4257 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4257 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4257 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4257 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4258 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4259 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 4276 Class I Histocompatibility antigen, domains alpha 1 and 2 4277 Class I Histocompatibility antigen, domains alpha 1 and 2 4282 Macrophage migration inhibitory factor (MIF) 4283 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 4284 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 4285 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but fun 4286 Helix-loop-helix DNA-binding domain 4287 Josephin 4287 Josephin 4288 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 4289 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 4289 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 4291 3'-5' exonuclease 4293 Protein kinase domain 4297 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 4297 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 4297 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 4298 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 4300 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 4301 DIL domain. The DIL domain has no known function 4301 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 4301 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 4306 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 4306 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 4308 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 4312 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4312 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4312 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4313 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4313 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4313 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4314 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4314 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4314 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4316 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4316 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4317 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4317 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4317 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4318 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4318 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4319 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4319 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4319 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4320 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4320 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4321 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4321 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4321 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4322 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4322 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4322 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4323 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4323 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4323 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4324 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4324 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4324 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4325 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4325 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4325 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4325 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4325 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4325 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4326 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4326 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4326 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 4327 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 4328 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 4329 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 4332 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 4332 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 4335 Helix-loop-helix DNA-binding domain 4338 MoaE protein. This family contains the MoaE protein that is involved in biosynthesis of molybdopterin. Molybdopterin, the universal component of the pterin molybdenum cofactors, contains a dithiolene group serving to bind Mo. Addition of the dithiolene s 4338 ThiS family. ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in E. coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activ 4342 Protein kinase domain 4350 Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA 4351 Phosphomannose isomerase type I 4352 Fibronectin type III domain 4353 Animal haem peroxidase 4354 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 4355 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 4358 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 4360 Lectin C-type domain. This family includes both long and short form C-type 4361 Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o 4361 Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o 4363 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 4363 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 4430 Myosin head (motor domain) 4478 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4478 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4481 Macrophage scavenger receptor 4481 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 4481 Macrophage scavenger receptor 4481 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 4481 Macrophage scavenger receptor 4481 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 4485 Trypsin 4485 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 4486 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 4486 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 4486 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 4487 Homeobox domain 4488 Homeobox domain 4507 Phosphorylase family 2 4508 ATP synthase A chain 4512 Cytochrome C and Quinol oxidase polypeptide I 4513 Cytochrome C oxidase subunit II, periplasmic domain 4521 NUDIX domain 4522 Formate--tetrahydrofolate ligase 4524 Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barr 4534 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 4535 NADH dehydrogenase 4536 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 4537 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 4539 NADH-ubiquinone/plastoquinone oxidoreductase chain 4L 4541 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 4543 7 transmembrane receptor (rhodopsin family) 4544 7 transmembrane receptor (rhodopsin family) 4547 Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitell 4552 Flavodoxin 4552 Flavodoxin 4582 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 4582 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 4582 Structural protein 2. This family represents structural protein 2 of the hepatitis E virus. The high basic amino acid content of this protein has lead to the suggestion of a role in viral genomic RNA encapsidation 4583 von Willebrand factor type D domain 4583 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 4583 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe 4587 von Willebrand factor type D domain 4587 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 4591 B-box zinc finger 4593 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 4594 Methylmalonyl-CoA mutase. The enzyme methylmalonyl-CoA mutase is a member of a class of enzymes that uses coenzyme B12 (adenosylcobalamin) as a cofactor. The enzyme induces the formation of an adenosyl radical from the cofactor. This radical then initiat 4599 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 4600 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 4603 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 4609 Helix-loop-helix DNA-binding domain 4609 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 4610 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 4611 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 4613 Helix-loop-helix DNA-binding domain 4613 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 4615 Death domain 4616 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 4617 Helix-loop-helix DNA-binding domain 4617 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 4618 Helix-loop-helix DNA-binding domain 4618 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 4619 Myosin head (motor domain) 4620 Myosin head (motor domain) 4621 Myosin head (motor domain) 4622 Myosin head (motor domain) 4624 Myosin head (motor domain) 4624 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 4625 Myosin head (motor domain) 4626 Myosin head (motor domain) 4627 Myosin head (motor domain) 4627 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 4628 Myosin head (motor domain) 4628 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4628 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 4628 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 4629 Myosin head (motor domain) 4629 Myosin head (motor domain) 4638 Fibronectin type III domain 4638 Fibronectin type III domain 4638 Fibronectin type III domain 4638 Fibronectin type III domain 4638 Fibronectin type III domain 4638 Fibronectin type III domain 4638 Fibronectin type III domain 4642 Myosin head (motor domain) 4644 Myosin head (motor domain) 4648 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 4648 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4649 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 4650 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 4651 PH domain. PH stands for pleckstrin homology 4651 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4654 Helix-loop-helix DNA-binding domain 4654 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 4656 Helix-loop-helix DNA-binding domain 4656 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 4661 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 4668 Melibiase 4673 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 4673 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 4674 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 4675 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 4676 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 4681 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 4681 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 4683 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 4683 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 4684 Fibronectin type III domain 4685 Fibronectin type III domain 4687 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 4688 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 4690 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 4692 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 4693 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 4703 Nebulin repeat 4729 Respiratory-chain NADH dehydrogenase 24 Kd subunit 4734 C2 domain 4734 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 4734 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 4735 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 4736 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 4738 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 4741 Intermediate filament protein 4744 Intermediate filament protein 4745 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 4747 Intermediate filament protein 4750 Protein kinase domain 4753 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 4756 Fibronectin type III domain 4760 Helix-loop-helix DNA-binding domain 4761 Helix-loop-helix DNA-binding domain 4762 Helix-loop-helix DNA-binding domain 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4771 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 4771 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 4774 CTF/NF-I family 4774 CTF/NF-I family 4781 CTF/NF-I family 4782 CTF/NF-I family 4784 CTF/NF-I family 4790 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 4791 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 4796 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 4801 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 4803 Nerve growth factor family 4804 TNFR/NGFR cysteine-rich region 4808 Helix-loop-helix DNA-binding domain 4811 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 4817 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 4820 Cyclophilin type peptidyl-prolyl cis-trans isomerase 4821 Homeobox domain 4824 Homeobox domain 4825 Homeobox domain 4829 7 transmembrane receptor (rhodopsin family) 4830 Nucleoside diphosphate kinase 4831 Nucleoside diphosphate kinase 4832 Nucleoside diphosphate kinase 4833 Nucleoside diphosphate kinases 4835 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 4837 NNMT/PNMT/TEMT family 4838 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 4839 NOL1/NOP2/sun family 4842 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 4842 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 4843 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 4843 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 4846 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 4848 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 4849 Not1 N-terminal domain, CCR4-Not complex component 4849 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 4852 Pancreatic hormone peptide 4853 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 4854 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 4860 Phosphorylase family 2 4862 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 4864 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 4868 Fibronectin type III domain 4869 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 4878 Atrial natriuretic peptide 4879 Atrial natriuretic peptide 4880 Atrial natriuretic peptide 4881 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 4882 Protein kinase domain 4882 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 4882 Protein kinase domain 4882 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 4883 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 4886 7 transmembrane receptor (rhodopsin family) 4887 7 transmembrane receptor (rhodopsin family) 4889 7 transmembrane receptor (rhodopsin family) 4891 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined 4892 Nebulin repeat 4897 Fibronectin type III domain 4901 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 4902 Transforming growth factor beta like domain 4905 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o 4908 Nerve growth factor family 4909 Nerve growth factor family 4914 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 4915 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 4916 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 4917 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 4919 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 4920 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 4921 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 4923 7 transmembrane receptor (rhodopsin family) 4928 Conserved nucleoporin domain 4928 Conserved nucleoporin domain 4928 Conserved nucleoporin domain 4928 Conserved nucleoporin domain 4929 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 4929 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 4929 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 4929 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 4929 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 4929 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 4929 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 4929 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 4935 Ocular albinism type 1 protein 4939 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance t 4940 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance t 4943 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 4946 Ornithine decarboxylase antizyme 4947 Ornithine decarboxylase antizyme 4950 Occludin/ELL family 4952 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 4952 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 4953 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 4956 Hsp20/alpha crystallin family 4976 Dynamin family 4976 Dynamin family 4976 Dynamin family 4976 Dynamin family 4976 Dynamin family 4976 Dynamin family 4976 Dynamin family 4976 Dynamin family 4982 TNFR/NGFR cysteine-rich region 4985 7 transmembrane receptor (rhodopsin family) 4986 7 transmembrane receptor (rhodopsin family) 4987 7 transmembrane receptor (rhodopsin family) 4987 7 transmembrane receptor (rhodopsin family) 4988 7 transmembrane receptor (rhodopsin family) 4991 7 transmembrane receptor (rhodopsin family) 4992 7 transmembrane receptor (rhodopsin family) 4993 7 transmembrane receptor (rhodopsin family) 4994 7 transmembrane receptor (rhodopsin family) 4995 7 transmembrane receptor (rhodopsin family) 4999 Origin recognition complex subunit 2. All DNA replication initiation is driven by a single conserved eukaryotic initiator complex termed he origin recognition complex (ORC). The ORC is a six protein complex. The function of ORC is reviewed in 5004 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5005 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5007 Oxysterol-binding protein 5008 LIF / OSM family 5010 PMP-22/EMP/MP20/Claudin family 5013 Homeobox domain 5013 Otx1 transcription factor 5015 Homeobox domain 5015 Otx1 transcription factor 5015 Homeobox domain 5015 Otx1 transcription factor 5020 Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5 5021 7 transmembrane receptor (rhodopsin family) 5023 ATP P2X receptor 5024 ATP P2X receptor 5025 ATP P2X receptor 5025 ATP P2X receptor 5025 ATP P2X receptor 5026 ATP P2X receptor 5026 ATP P2X receptor 5026 ATP P2X receptor 5027 ATP P2X receptor 5027 ATP P2X receptor 5028 7 transmembrane receptor (rhodopsin family) 5029 7 transmembrane receptor (rhodopsin family) 5029 7 transmembrane receptor (rhodopsin family) 5029 7 transmembrane receptor (rhodopsin family) 5030 7 transmembrane receptor (rhodopsin family) 5031 7 transmembrane receptor (rhodopsin family) 5031 7 transmembrane receptor (rhodopsin family) 5031 7 transmembrane receptor (rhodopsin family) 5031 7 transmembrane receptor (rhodopsin family) 5032 7 transmembrane receptor (rhodopsin family) 5034 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 5036 metallopeptidase family M24 5037 Phosphatidylethanolamine-binding protein 5042 Poly-adenylate binding protein, unique domain 5045 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Giardia variant-specific surface protein 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Giardia variant-specific surface protein 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5046 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5047 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5049 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu 5050 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu 5051 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 5053 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 5054 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5058 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 5062 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 5063 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 5064 Paralemmin 5066 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5066 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 5067 Fibronectin type III domain 5067 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 5068 Lectin C-type domain. This family includes both long and short form C-type 5068 Lectin C-type domain. This family includes both long and short form C-type 5068 Lectin C-type domain. This family includes both long and short form C-type 5071 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 5071 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 5071 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 5071 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 5071 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 5071 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 5073 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 5075 'Paired box' domain 5076 'Paired box' domain 5076 'Paired box' domain 5077 'Paired box' domain 5077 'Paired box' domain 5077 Homeobox domain 5077 'Paired box' domain 5077 Homeobox domain 5077 'Paired box' domain 5077 Homeobox domain 5077 'Paired box' domain 5077 Homeobox domain 5077 'Paired box' domain 5077 Homeobox domain 5077 'Paired box' domain 5078 Homeobox domain 5078 'Paired box' domain 5079 'Paired box' domain 5080 Homeobox domain 5080 'Paired box' domain 5080 Homeobox domain 5080 'Paired box' domain 5081 Homeobox domain 5081 'Paired box' domain 5081 Homeobox domain 5081 'Paired box' domain 5082 Phosducin 5083 'Paired box' domain 5087 Homeobox domain 5087 PBX domain. The PBX domain is a bipartite acidic domain 5089 Homeobox domain 5089 PBX domain. The PBX domain is a bipartite acidic domain 5090 Homeobox domain 5090 PBX domain. The PBX domain is a bipartite acidic domain 5092 Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha) 5096 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 5099 Cadherin domain 5100 Cadherin domain 5100 Cadherin domain 5101 Cadherin domain 5104 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5105 Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate 5106 Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate 5110 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 5111 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 5111 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 5118 CUB domain 5122 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic 5122 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5125 Giardia variant-specific surface protein 5125 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic 5125 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5126 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 5129 Protein kinase domain 5132 Phosducin 5132 Phosducin 5134 MYND finger 5134 Programmed cell death protein 2, C-terminal domain 5134 MYND finger 5134 Programmed cell death protein 2, C-terminal domain 5136 3'5'-cyclic nucleotide phosphodiesterase 5137 3'5'-cyclic nucleotide phosphodiesterase 5138 3'5'-cyclic nucleotide phosphodiesterase 5139 3'5'-cyclic nucleotide phosphodiesterase 5140 3'5'-cyclic nucleotide phosphodiesterase 5141 3'5'-cyclic nucleotide phosphodiesterase 5142 3'5'-cyclic nucleotide phosphodiesterase 5143 3'5'-cyclic nucleotide phosphodiesterase 5144 3'5'-cyclic nucleotide phosphodiesterase 5145 3'5'-cyclic nucleotide phosphodiesterase 5146 3'5'-cyclic nucleotide phosphodiesterase 5150 3'5'-cyclic nucleotide phosphodiesterase 5150 3'5'-cyclic nucleotide phosphodiesterase 5151 3'5'-cyclic nucleotide phosphodiesterase 5152 3'5'-cyclic nucleotide phosphodiesterase 5153 3'5'-cyclic nucleotide phosphodiesterase 5154 Platelet-derived growth factor (PDGF) 5154 Platelet-derived growth factor (PDGF) 5155 Platelet-derived growth factor (PDGF) 5155 Platelet-derived growth factor (PDGF) 5158 3'5'-cyclic nucleotide phosphodiesterase 5160 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 5161 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 5163 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 5164 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 5165 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 5166 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 5167 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 5168 Somatomedin B domain 5168 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 5169 Somatomedin B domain 5169 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 5172 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 5173 Vertebrate endogenous opioids neuropeptide 5179 Vertebrate endogenous opioids neuropeptide 5184 metallopeptidase family M24 5194 Peroxin 13, N-terminal domain. Both termini of the Peroxin-13 are oriented to the cytosol. Peroxin-13 is required for peroxisomal association of peroxin-14 5196 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 5197 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 5198 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 5198 AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d 5199 Thrombospondin type 1 domain 5201 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 5202 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 5204 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 5204 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 5204 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 5207 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 5207 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 5208 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 5209 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 5210 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 5210 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 5211 Phosphofructokinase 5212 LCCL domain 5212 von Willebrand factor type A domain 5213 Phosphofructokinase 5214 Phosphofructokinase 5217 Profilin 5222 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 5225 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 5230 Phosphoglycerate kinase 5232 Phosphoglycerate kinase 5241 Progesterone receptor 5241 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5245 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 5250 Mitochondrial carrier protein 5250 Mitochondrial carrier protein 5252 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 5252 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 5253 jmjC domain 5253 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 5253 jmjC domain 5257 Glycosyl hydrolases family 15 5265 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5268 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5269 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5270 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5275 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5286 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5286 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 5287 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5287 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 5288 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5288 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 5289 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5290 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5290 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 5291 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5291 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 5292 Protein kinase domain 5293 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5293 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 5294 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 5294 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 5295 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 5295 SH2 domain 5295 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 5295 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 5296 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 5300 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 5300 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 5301 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 5303 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 5305 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata 5306 Phosphatidylinositol transfer protein 5307 Homeobox domain 5308 Homeobox domain 5308 Homeobox domain 5308 Homeobox domain 5309 Homeobox domain 5310 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 5310 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 5310 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 5310 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 5310 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 5311 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 5313 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 5313 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 5314 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 5314 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 5319 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 5320 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 5322 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 5325 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 5327 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 5327 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 5327 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 5328 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 5330 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 5330 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 5331 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 5331 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 5332 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 5332 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 5333 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 5335 SH2 domain 5335 PH domain. PH stands for pleckstrin homology 5335 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 5335 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 5335 SH2 domain 5335 PH domain. PH stands for pleckstrin homology 5335 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 5335 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 5336 SH2 domain 5336 PH domain. PH stands for pleckstrin homology 5336 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 5337 PX domain. PX domains bind to phosphoinositides 5338 PX domain. PX domains bind to phosphoinositides 5339 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 5340 Trypsin 5340 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 5341 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 5342 PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some 5345 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 5346 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 5347 POLO box duplicated region 5348 ATP1G1/PLM/MAT8 family 5348 ATP1G1/PLM/MAT8 family 5349 ATP1G1/PLM/MAT8 family 5349 ATP1G1/PLM/MAT8 family 5350 Phospholamban. The regulation of calcium levels across the membrane of the sarcoplasmic reticulum involves the interplay of many membrane proteins. Phospholamban is a 52 residue integral membrane protein that is involved in reversibly inhibiting the Ca(2 5354 Myelin proteolipid protein (PLP or lipophilin) 5359 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 5360 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 5360 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 5361 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 5361 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 5361 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 5364 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 5364 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 5365 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 5368 Vertebrate endogenous opioids neuropeptide 5371 B-box zinc finger 5371 Zinc finger, C3HC4 type (RING finger) 5371 Zinc finger 5371 Zinc finger 5371 Zinc finger 5372 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 5373 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 5375 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5376 PMP-22/EMP/MP20/Claudin family 5376 PMP-22/EMP/MP20/Claudin family 5376 PMP-22/EMP/MP20/Claudin family 5380 Deuterolysin metalloprotease (M35) family 5394 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 5396 Homeobox domain 5396 Homeobox domain 5406 Lipase 5406 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 5407 Lipase 5407 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 5408 Lipase 5408 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 5409 NNMT/PNMT/TEMT family 5412 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 5413 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 5414 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 5414 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 5414 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 5422 DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities 5427 DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme con 5428 DNA polymerase family A 5430 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 5430 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu 5431 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com 5433 RNA polymerase Rpb4 5434 RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain 5434 RNA polymerase Rpb5, N-terminal domain. Rpb5 has a bipartite structure which includes a eukaryote-specific N-terminal domain and a C-terminal domain resembling the archaeal RNAP subunit H. The N-terminal domain is involved in DNA binding and is part of t 5435 RNA polymerase Rpb6. Rpb6 is an essential subunit in the eukaryotic polymerases Pol I, II and III. The bacterial equivalent to Rpb6 is the omega subunit. Rpb6 and omega are structurally conserved and both function in polymerase assembly 5437 RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits 5439 RNA polymerases L / 13 to 16 kDa subunit 5443 Corticotropin ACTH domain 5444 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. 5445 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. 5446 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. 5451 Homeobox domain 5452 Homeobox domain 5457 Pou domain - N-terminal to homeobox domain 5458 Pou domain - N-terminal to homeobox domain 5459 Pou domain - N-terminal to homeobox domain 5460 Homeobox domain 5464 Inorganic pyrophosphatase 5465 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5465 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5467 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5467 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5467 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5467 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5468 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5468 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5468 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5468 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5468 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5468 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5468 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5468 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5471 Glutamine amidotransferases class-II 5473 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 5476 Serine carboxypeptidase 5478 Cyclophilin type peptidyl-prolyl cis-trans isomerase 5479 Cyclophilin type peptidyl-prolyl cis-trans isomerase 5480 Cyclophilin type peptidyl-prolyl cis-trans isomerase 5481 Cyclophilin type peptidyl-prolyl cis-trans isomerase 5496 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 5496 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 5506 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 5507 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 5509 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 5511 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 5514 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 5518 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 5519 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 5519 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 5524 Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is 5524 Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is 5524 Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is 5524 Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is 5524 Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is 5525 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5526 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5527 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5527 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5527 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5527 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5528 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5528 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5528 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5529 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 5538 Palmitoyl protein thioesterase 5539 Pancreatic hormone peptide 5540 7 transmembrane receptor (rhodopsin family) 5547 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 5551 C2 domain 5551 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 5557 DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits 5558 Eukaryotic-type DNA primase, large subunit. DNA primase is the polymerase that synthesises small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa 5564 pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase 5569 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 5569 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 5570 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 5570 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 5570 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 5573 Cyclic nucleotide-binding domain 5576 Cyclic nucleotide-binding domain 5578 C2 domain 5578 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5579 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5580 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5581 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5582 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5583 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5584 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5585 Hr1 repeat 5586 Hr1 repeat 5587 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5588 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5590 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5591 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 5591 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 5595 Protein kinase domain 5617 Somatotropin hormone family 5623 Transforming growth factor beta like domain 5625 Proline dehydrogenase 5626 Homeobox domain 5627 Laminin G domain 5627 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 5630 Intermediate filament protein 5641 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this 5648 Trypsin 5648 CUB domain 5648 CUB domain 5649 Reeler domain 5649 Reeler domain 5651 Trypsin 5651 CUB domain 5651 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 5651 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 5657 Trypsin 5660 Saposin A-type domain 5662 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 5663 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 5663 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 5663 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 5664 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 5664 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 5689 Proteasome A-type and B-type 5690 Proteasome A-type and B-type 5691 Proteasome A-type and B-type 5692 Proteasome A-type and B-type 5697 Pancreatic hormone peptide 5698 Proteasome A-type and B-type 5698 Proteasome A-type and B-type 5713 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 5720 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 5720 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 5720 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 5720 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 5721 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 5721 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 5724 7 transmembrane receptor (rhodopsin family) 5725 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 5725 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 5725 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 5727 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 5729 7 transmembrane receptor (rhodopsin family) 5730 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5731 7 transmembrane receptor (rhodopsin family) 5732 7 transmembrane receptor (rhodopsin family) 5733 7 transmembrane receptor (rhodopsin family) 5734 7 transmembrane receptor (rhodopsin family) 5737 7 transmembrane receptor (rhodopsin family) 5739 7 transmembrane receptor (rhodopsin family) 5740 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 5741 Parathyroid hormone family 5742 Animal haem peroxidase 5742 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 5742 Animal haem peroxidase 5742 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 5743 Animal haem peroxidase 5743 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 5744 Parathyroid hormone family 5745 7 transmembrane receptor (Secretin family) 5746 7 transmembrane receptor (Secretin family) 5747 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 5747 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 5764 PTN/MK heparin-binding protein family 5768 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 5774 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 5774 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 5775 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 5775 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 5777 SH2 domain 5777 SH2 domain 5777 SH2 domain 5780 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 5780 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 5781 SH2 domain 5783 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 5783 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 5783 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 5783 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 5786 Protein-tyrosine phosphatase 5787 Fibronectin type III domain 5788 Fibronectin type III domain 5788 Fibronectin type III domain 5788 Fibronectin type III domain 5789 Fibronectin type III domain 5789 Fibronectin type III domain 5789 Fibronectin type III domain 5789 Fibronectin type III domain 5792 Fibronectin type III domain 5792 Fibronectin type III domain 5793 Eukaryotic-type carbonic anhydrase 5795 Fibronectin type III domain 5796 Fibronectin type III domain 5797 Fibronectin type III domain 5800 Fibronectin type III domain 5800 Fibronectin type III domain 5802 Fibronectin type III domain 5802 Fibronectin type III domain 5802 Fibronectin type III domain 5802 Fibronectin type III domain 5803 Fibronectin type III domain 5803 Eukaryotic-type carbonic anhydrase 5805 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. 5810 Repair protein Rad1/Rec1/Rad17 5810 Repair protein Rad1/Rec1/Rad17 5810 Repair protein Rad1/Rec1/Rad17 5822 Periodic tryptophan protein 2 WD repeat associated domain 5827 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 5829 Paxillin family 5829 LIM domain. This family represents two copies of the LIM structural domain 5831 Delta 1-pyrroline-5-carboxylate reductase 5831 Delta 1-pyrroline-5-carboxylate reductase 5834 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 5836 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 5837 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 5858 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 5858 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 5859 tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyl 5861 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 5863 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 5868 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 5869 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 5873 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 5874 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 5878 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 5883 Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5884 Rad17 cell cycle checkpoint protein 5887 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 5887 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 5888 Helix-hairpin-helix motif 5893 Rad52/22 family double-strand break repair protein 5893 Rad52/22 family double-strand break repair protein 5893 Rad52/22 family double-strand break repair protein 5893 Rad52/22 family double-strand break repair protein 5894 Protein kinase domain 5894 Raf-like Ras-binding domain 5894 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 5897 Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activate 5900 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 5902 RanBP1 domain 5903 RanBP1 domain 5903 Zn-finger in Ran binding protein and others 5903 Cyclophilin type peptidyl-prolyl cis-trans isomerase 5909 Rap/ran-GAP 5914 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5914 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5915 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5915 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5915 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5915 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5916 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 5916 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 5920 NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins 5921 SH2 domain 5921 PH domain. PH stands for pleckstrin homology 5921 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 5921 SH2 domain 5921 PH domain. PH stands for pleckstrin homology 5921 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 5922 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 5923 Guanine nucleotide exchange factor for Ras-like GTPases 5923 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 5924 PH domain. PH stands for pleckstrin homology 5924 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 5924 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 5925 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 5925 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve 5926 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 5926 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 5926 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 5927 jmjC domain 5927 jmjN domain 5927 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 5927 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 5927 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 5932 Ezrin/radixin/moesin family 5933 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 5933 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve 5934 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 5934 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve 5936 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol 5947 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5948 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5949 Interphotoreceptor retinoid-binding protein. Interphotoreceptor retinoid-binding protein (IRBP) mediates retinoid trafficking between the photoreceptors and pigmented epithelium. This Pfam family represents a repeat domain found four times in the mammali 5950 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 5956 7 transmembrane receptor (rhodopsin family) 5956 7 transmembrane receptor (rhodopsin family) 5956 C.elegans Srg family integral membrane protein 5959 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 5961 Tetraspanin family 5962 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 5962 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 5966 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 5970 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 5971 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 5972 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 5977 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 5979 Cadherin domain 5979 Cadherin domain 5979 Cadherin domain 5979 Cadherin domain 5981 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 5986 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 5987 B-box zinc finger 5987 B-box zinc finger 5989 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5990 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5990 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5991 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5991 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5992 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5993 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 5995 7 transmembrane receptor (rhodopsin family) 5996 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 5997 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 5998 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 5998 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 5999 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 6000 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 6000 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 6002 Raf-like Ras-binding domain 6002 Phosphotyrosine interaction domain (PTB/PID) 6005 Ammonium Transporter Family 6006 Ammonium Transporter Family 6006 Ammonium Transporter Family 6006 Ammonium Transporter Family 6006 Ammonium Transporter Family 6006 Ammonium Transporter Family 6007 Ammonium Transporter Family 6007 Ammonium Transporter Family 6010 7 transmembrane receptor (rhodopsin family) 6011 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 6013 Insulin/IGF/Relaxin family. Superfamily includes insulins 6017 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 6017 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 6019 Insulin/IGF/Relaxin family. Superfamily includes insulins 6019 Insulin/IGF/Relaxin family. Superfamily includes insulins 6041 Protein kinase domain 6050 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 6051 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 6091 Fibronectin type III domain 6091 Fibronectin type III domain 6092 Fibronectin type III domain 6092 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 6094 Tetraspanin family 6095 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6095 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6095 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6095 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6095 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6095 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6095 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6095 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6096 metallopeptidase family M24 6096 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6096 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6097 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6097 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6098 Fibronectin type III domain 6117 Replication factor-A protein 1, N-terminal domain 6121 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 6122 Ribosomal protein L3 6123 Ribosomal protein L3 6124 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 6125 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 6128 Ribosomal protein L6e 6128 Ribosomal protein L6, N-terminal domain 6129 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 6130 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 6132 Ribosomal Proteins L2 6132 Ribosomal Proteins L2 6134 Ribosomal L10 6137 Ribosomal protein L13e 6137 Ribosomal protein L13e 6138 Ribosomal L15 6139 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 6141 Eukaryotic ribosomal protein L18 6142 Ribosomal L18ae protein family 6143 Ribosomal protein L19e 6144 Ribosomal protein L21e 6146 Ribosomal L22e protein family 6150 Ribosomal protein L23 6152 Ribosomal protein L24e 6155 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 6158 Ribosomal L28e protein family 6159 Ribosomal L29e protein family 6160 Ribosomal protein L31e 6161 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 6164 Ribosomal protein L34e 6164 Ribosomal protein L34e 6165 Ribosomal protein L35Ae 6166 Ribosomal protein L44 6167 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 6168 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 6169 Ribosomal L38e protein family 6173 Ribosomal protein L44 6175 Ribosomal protein L10 6175 Ribosomal protein L10 6183 Ribosomal protein S12 6183 Ribosomal protein S12 6183 Ribosomal protein S12 6189 Ribosomal S3Ae family 6189 Ribosomal S3Ae family 6193 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 6194 Ribosomal protein S6e 6201 Ribosomal protein S7e 6202 Ribosomal protein S8e 6204 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 6205 Ribosomal protein S17 6206 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 6207 Ribosomal protein S15 6210 Ribosomal protein S8 6217 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 6218 Ribosomal S17 6222 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 6223 Ribosomal protein S19e 6224 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 6227 Ribosomal protein S21e 6230 S25 ribosomal protein 6231 Ribosomal protein S26e 6232 Ribosomal protein S27 6233 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 6233 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis o 6234 Ribosomal protein S28e 6240 Ribonucleotide reductase, barrel domain 6241 Ribonucleotide reductase, small chain 6247 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 6249 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 6252 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 6253 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 6256 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6256 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6257 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6257 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6258 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 6258 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 6261 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 6261 RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown 6261 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 6261 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 6262 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 6262 RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown 6262 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 6262 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 6263 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 6263 RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown 6263 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 6263 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 6271 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6273 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6274 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6275 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6275 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6276 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6277 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6278 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6279 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6280 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6281 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6282 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6283 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6284 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6285 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6286 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 6288 Serum amyloid A protein 6289 Serum amyloid A protein 6291 Serum amyloid A protein 6293 Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 6293 Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 6294 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 6295 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 6295 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 6299 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 6302 Tetraspanin family 6303 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 6304 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 6307 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 6309 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 6320 Lectin C-type domain. This family includes both long and short form C-type 6322 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 6323 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6324 Immunoglobulin domain 6326 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6328 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6329 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6331 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6332 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6334 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6335 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6336 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 6337 Amiloride-sensitive sodium channel 6338 Amiloride-sensitive sodium channel 6339 Amiloride-sensitive sodium channel 6340 Amiloride-sensitive sodium channel 6342 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 6343 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 6344 7 transmembrane receptor (Secretin family) 6344 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 6346 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6347 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6348 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6349 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6351 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6354 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6355 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6369 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6372 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6373 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6374 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6382 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 6383 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 6385 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 6387 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 6391 Succinate dehydrogenase cytochrome b subunit 6397 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 6397 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 6401 Lectin C-type domain. This family includes both long and short form C-type 6402 Lectin C-type domain. This family includes both long and short form C-type 6403 Sushi domain (SCR repeat) 6403 Lectin C-type domain. This family includes both long and short form C-type 6405 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 6418 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 6419 Transposase. This family includes the mariner transposase 6419 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 6423 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 6423 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 6424 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 6426 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 6435 Lectin C-type domain. This family includes both long and short form C-type 6436 Lectin C-type domain. This family includes both long and short form C-type 6439 Surfactant protein B 6439 Saposin A-type domain 6440 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 6441 Lectin C-type domain. This family includes both long and short form C-type 6441 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 6448 Sulfatase 6462 Laminin G domain 6464 SH2 domain 6464 Phosphotyrosine interaction domain (PTB/PID) 6469 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 6469 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 6470 Serine hydroxymethyltransferase 6470 Serine hydroxymethyltransferase 6472 Serine hydroxymethyltransferase 6473 Homeobox domain 6473 Homeobox domain 6474 Homeobox domain 6474 Homeobox domain 6476 Trefoil (P-type) domain 6476 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 6477 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 6478 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 6480 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6480 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6480 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6482 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6482 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6483 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6484 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6487 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6489 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 6494 Rap/ran-GAP 6494 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 6494 Rap/ran-GAP 6494 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 6497 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 6498 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 6505 Sodium:dicarboxylate symporter family 6506 Sodium:dicarboxylate symporter family 6507 Sodium:dicarboxylate symporter family 6508 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 6509 Sodium:dicarboxylate symporter family 6510 Sodium:dicarboxylate symporter family 6511 Sodium:dicarboxylate symporter family 6512 Sodium:dicarboxylate symporter family 6513 Sugar (and other) transporter 6514 Sugar (and other) transporter 6515 Sugar (and other) transporter 6517 Sugar (and other) transporter 6518 Sugar (and other) transporter 6521 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 6522 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 6523 Sodium:solute symporter family 6524 Sodium:solute symporter family 6525 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 6525 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 6525 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 6526 Sodium:solute symporter family 6527 Sodium:solute symporter family 6529 Sodium:neurotransmitter symporter family 6530 Sodium:neurotransmitter symporter family 6531 Sodium:neurotransmitter symporter family 6532 Sodium:neurotransmitter symporter family 6532 Serotonin (5-HT) neurotransmitter transporter, N-terminus 6533 Sodium:neurotransmitter symporter family 6534 Sodium:neurotransmitter symporter family 6535 Sodium:neurotransmitter symporter family 6536 Sodium:neurotransmitter symporter family 6538 Sodium:neurotransmitter symporter family 6539 Sodium:neurotransmitter symporter family 6540 Sodium:neurotransmitter symporter family 6543 Calx-beta domain 6543 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 6546 Calx-beta domain 6546 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 6547 Calx-beta domain 6547 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 6547 Calx-beta domain 6547 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 6548 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 6549 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 6550 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 6553 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 6554 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 6555 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 6556 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined 6560 K-Cl Co-transporter type 1 (KCC1) 6561 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 6563 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 6564 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 6565 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 6569 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 6573 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 6574 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 6575 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 6576 Mitochondrial carrier protein 6578 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6578 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6579 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6579 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6579 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6579 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6579 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6579 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 6580 Sugar (and other) transporter 6580 Sugar (and other) transporter 6581 Sugar (and other) transporter 6582 Sugar (and other) transporter 6582 Sugar (and other) transporter 6583 Sugar (and other) transporter 6584 Sugar (and other) transporter 6585 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 6590 WAP-type (Whey Acidic Protein) 'four-disulfide core' 6594 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6594 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6595 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6595 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6596 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6596 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6597 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 6603 BAF60b domain of the SWIB complex 6604 BAF60b domain of the SWIB complex 6605 HMG (high mobility group) box 6608 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 6609 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 6610 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 6611 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 6620 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 6622 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 6622 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 6623 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 6626 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 6629 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 6632 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 6633 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 6633 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 6634 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 6637 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 6640 PH domain. PH stands for pleckstrin homology 6641 PH domain. PH stands for pleckstrin homology 6641 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 6642 PX domain. PX domains bind to phosphoinositides 6642 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 6642 PX domain. PX domains bind to phosphoinositides 6642 PX domain. PX domains bind to phosphoinositides 6642 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 6643 PX domain. PX domains bind to phosphoinositides 6643 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 6645 PH domain. PH stands for pleckstrin homology 6647 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 6648 Iron/manganese superoxide dismutases, C-terminal domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. 6649 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 6651 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6651 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6651 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6653 Fibronectin type III domain 6653 Low-density lipoprotein receptor domain class A 6655 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 6656 HMG (high mobility group) box 6657 HMG (high mobility group) box 6657 HMG (high mobility group) box 6658 HMG (high mobility group) box 6659 HMG (high mobility group) box 6660 HMG (high mobility group) box 6660 HMG (high mobility group) box 6660 HMG (high mobility group) box 6662 HMG (high mobility group) box 6663 HMG (high mobility group) box 6664 HMG (high mobility group) box 6665 HMG (high mobility group) box 6666 HMG (high mobility group) box 6667 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 6668 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 6668 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 6670 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 6671 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 6672 HMG (high mobility group) box 6672 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 6672 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 6675 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri 6677 Hyaluronidase 6677 Hyaluronidase 6678 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 6683 MIT domain 6688 Ets-domain 6690 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 6691 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 6694 Cystatin domain 6695 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 6696 Osteopontin 6697 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 6699 Cornifin (SPRR) family 6707 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 6708 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 6708 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 6709 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 6709 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 6710 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 6711 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 6711 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 6712 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 6714 SH2 domain 6714 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 6715 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 6716 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 6718 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 6720 Helix-loop-helix DNA-binding domain 6721 Helix-loop-helix DNA-binding domain 6722 SRF-type transcription factor (DNA-binding and dimerisation domain) 6723 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 6727 Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding 6728 SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. Th 6734 Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of 6736 HMG (high mobility group) box 6737 B-box zinc finger 6737 Zinc finger, C3HC4 type (RING finger) 6742 Single-strand binding protein family 6745 Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across 6749 HMG (high mobility group) box 6749 Structure-specific recognition protein 6750 Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som 6751 7 transmembrane receptor (rhodopsin family) 6752 7 transmembrane receptor (rhodopsin family) 6753 7 transmembrane receptor (rhodopsin family) 6754 7 transmembrane receptor (rhodopsin family) 6755 7 transmembrane receptor (rhodopsin family) 6764 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 6764 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 6764 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 6764 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 6764 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 6764 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 6768 CUB domain 6769 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 6770 START domain 6772 SH2 domain 6772 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6772 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6772 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6772 SH2 domain 6772 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6772 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6772 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6773 SH2 domain 6773 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6773 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6773 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6774 SH2 domain 6774 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6774 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6774 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6774 SH2 domain 6774 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6774 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6774 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6775 SH2 domain 6775 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6775 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6775 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6776 SH2 domain 6776 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6776 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6777 SH2 domain 6777 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6777 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6778 SH2 domain 6778 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 6778 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 6778 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 6779 Statherin. Statherin functions biologically to inhibit the nucleation and growth of calcium phosphate minerals. The N-terminus of statherin is highly charge, the glutamic acids of which have been shown to be important in the recognition hydroxyapatite 6780 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6780 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6780 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6780 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6781 Stanniocalcin family 6782 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 6783 Sulfotransferase protein 6785 GNS1/SUR4 family 6799 Sulfotransferase protein 6799 Sulfotransferase protein 6804 Syntaxin 6809 Syntaxin 6812 Sec1 family 6813 Sec1 family 6814 Sec1 family 6817 Sulfotransferase protein 6817 Sulfotransferase protein 6817 Sulfotransferase protein 6817 Sulfotransferase protein 6817 Sulfotransferase protein 6818 Sulfotransferase protein 6818 Sulfotransferase protein 6819 Sulfotransferase protein 6819 Sulfotransferase protein 6820 Sulfotransferase proteins 6820 Sulfotransferase protein 6821 Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure 6821 Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity 6822 Sulfotransferase protein 6829 Supt5 repeat. This short region of similarity is found in two tandem copies in Supt5 proteins that are involved in chromatin regulation. The function of this region is unknown 6834 SURF1 family 6836 SURF4 family 6839 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 6840 Villin headpiece domain 6840 Villin headpiece domain 6843 Synaptobrevin 6843 Synaptobrevin 6844 Synaptobrevin 6845 Synaptobrevin 6850 SH2 domain 6853 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 6853 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 6854 N-terminal domain 6854 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 6854 N-terminal domain 6854 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 6855 Synaptophysin / synaptoporin 6856 Synaptophysin / synaptoporin 6856 Synaptophysin / synaptoporin 6857 C2 domain 6861 C2 domain 6862 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 6865 7 transmembrane receptor (rhodopsin family) 6866 Neurokinin B 6868 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 6868 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 6869 7 transmembrane receptor (rhodopsin family) 6869 7 transmembrane receptor (rhodopsin family) 6870 7 transmembrane receptor (rhodopsin family) 6880 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra 6883 Transcription initiation factor TFIID subunit A 6884 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon 6885 Protein kinase domain 6885 Protein kinase domain 6885 Protein kinase domain 6885 Protein kinase domain 6886 Helix-loop-helix DNA-binding domain 6888 Transaldolase 6895 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6895 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 6900 Fibronectin type III domain 6901 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 6901 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 6901 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 6902 Tubulin binding cofactor A 6905 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 6906 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 6915 7 transmembrane receptor (rhodopsin family) 6916 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 6916 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 6920 Transcription factor S-II (TFIIS) 6929 Helix-loop-helix DNA-binding domain 6932 HMG (high mobility group) box 6934 HMG (high mobility group) box 6935 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 6938 Helix-loop-helix DNA-binding domain 6941 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 6943 Helix-loop-helix DNA-binding domain 6944 Sodium transport protein 6945 Helix-loop-helix DNA-binding domain 6945 Helix-loop-helix DNA-binding domain 6945 Helix-loop-helix DNA-binding domain 6947 Eukaryotic cobalamin-binding protein 6948 Eukaryotic cobalamin-binding protein 6949 Treacher Collins syndrome protein Treacle 6950 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 6990 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 6991 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 6993 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 6996 Uracil DNA glycosylase superfamily 6999 Tryptophan 2,3-dioxygenase 7003 TEA/ATTS domain family 7004 TEA/ATTS domain family 7005 TEA/ATTS domain family 7006 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 7007 Zona pellucida-like domain 7007 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7009 Uncharacterized protein family UPF0005 7010 Fibronectin type III domain 7010 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 7013 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 7013 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 7014 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 7016 Protein kinase domain 7018 Transferrin 7019 HMG (high mobility group) box 7020 Transcription factor AP-2 7021 Transcription factor AP-2 7022 Transcription factor AP-2 7023 Helix-loop-helix DNA-binding domain 7025 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7025 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7026 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7026 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7027 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 7029 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 7030 Helix-loop-helix DNA-binding domain 7031 Trefoil (P-type) domain 7032 Trefoil (P-type) domain 7033 Trefoil (P-type) domain 7036 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 7037 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 7038 Carboxylesterase 7038 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 7040 Transforming growth factor beta like domain 7040 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 7041 Paxillin family 7041 LIM domain. This family represents two copies of the LIM structural domain 7042 Transforming growth factor beta like domain 7042 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 7043 Transforming growth factor beta like domain 7043 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 7044 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 7045 Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria 7046 Protein kinase domain 7046 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 7047 Transglutaminase family 7047 Transglutaminase family, C-terminal ig like domain 7047 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 7048 Protein kinase domain 7049 Zona pellucida-like domain 7051 Transglutaminase family 7051 Transglutaminase family, C-terminal ig like domain 7051 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 7052 Transglutaminase family 7052 Transglutaminase family, C-terminal ig like domain 7052 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 7053 Transglutaminase family 7053 Transglutaminase family, C-terminal ig like domain 7053 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 7054 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 7057 Thrombospondin N-terminal -like domain 7057 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 7058 Thrombospondin type 1 domain 7058 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 7060 Thrombospondin N-terminal -like domain 7062 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 7066 Erythropoietin/thrombopoietin 7067 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7068 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7068 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7074 Raf-like Ras-binding domain 7075 Fibronectin type III domain 7076 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 7077 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 7079 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 7080 Homeobox domain 7082 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 7082 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 7083 Thymidine kinase 7084 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 7088 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 7089 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 7090 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 7092 CUB domain 7092 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 7093 CUB domain 7093 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 7094 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 7095 Translocation protein Sec62 7096 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7097 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7098 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7099 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7099 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7099 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7099 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7100 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 7101 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7101 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7102 Tetraspanin family 7103 Tetraspanin family 7105 Tetraspanin family 7106 Tetraspanin family 7108 Ergosterol biosynthesis ERG4/ERG24 family 7111 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 7112 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 7113 Trypsin 7122 PMP-22/EMP/MP20/Claudin family 7124 TNF(Tumor Necrosis Factor) family 7126 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 7128 A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation 7130 CUB domain 7130 Extracellular link domain 7132 TNFR/NGFR cysteine-rich region 7133 TNFR/NGFR cysteine-rich region 7135 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 7136 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 7137 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 7142 Nuclear transition protein 2 7143 Fibronectin type III domain 7148 Fibronectin type III domain 7148 Fibrinogen beta and gamma chains, C-terminal globular domain 7148 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up in 7148 Fibronectin type III domain 7148 Fibrinogen beta and gamma chains, C-terminal globular domain 7156 Topoisomerase DNA binding C4 zinc finger 7157 P53 7159 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 7161 P53 7163 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 7164 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 7165 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 7166 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 7167 Triosephosphate isomerase 7168 Tropomyosin 7169 Tropomyosin 7170 Tropomyosin 7170 Tropomyosin 7171 Tropomyosin 7173 Animal haem peroxidase 7173 Animal haem peroxidase 7173 Animal haem peroxidase 7173 Animal haem peroxidase 7173 Animal haem peroxidase 7174 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 7176 Trypsin 7178 Translationally controlled tumor protein 7181 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7181 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7182 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7182 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7184 Hsp90 protein 7185 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7186 TRAF-type zinc finger 7186 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7186 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7187 TRAF-type zinc finger 7187 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7187 TRAF-type zinc finger 7187 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7187 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7188 TRAF-type zinc finger 7188 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7188 TRAF-type zinc finger 7188 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7189 TRAF-type zinc finger 7189 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7189 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 7189 TRAF-type zinc finger 7189 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 7189 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 7201 7 transmembrane receptor (rhodopsin family) 7203 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 7204 PH domain. PH stands for pleckstrin homology 7216 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 7216 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 7216 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 7216 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 7216 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 7220 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7222 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7223 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7224 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7225 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7226 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7227 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d 7247 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 7249 Tuberin 7249 Rap/ran-GAP 7249 Tuberin 7249 Rap/ran-GAP 7249 Tuberin 7249 Rap/ran-GAP 7253 7 transmembrane receptor (rhodopsin family) 7257 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 7258 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 7259 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 7274 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 7275 Tub family 7275 Tub family 7276 Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders 7287 Tub family 7288 Tub family 7289 Tub family 7291 Helix-loop-helix DNA-binding domain 7293 TNFR/NGFR cysteine-rich region 7294 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 7295 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 7298 Thymidylate synthase 7299 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 7301 Fibronectin type III domain 7306 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 7311 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7311 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 7314 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7316 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7316 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7317 Repeat in ubiquitin-activating (UBA) protein 7317 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 7317 Repeat in ubiquitin-activating (UBA) protein 7317 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 7318 Repeat in ubiquitin-activating (UBA) protein 7318 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 7328 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 7335 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 7335 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 7336 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 7343 HMG (high mobility group) box 7345 Ubiquitin carboxyl-terminal hydrolase, family 1 7347 Ubiquitin carboxyl-terminal hydrolase, family 1 7348 Tetraspanin family 7350 Mitochondrial carrier protein 7351 Mitochondrial carrier protein 7352 Mitochondrial carrier protein 7352 Mitochondrial carrier protein 7353 Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family inclu 7355 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 7356 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 7358 UDP-glucose/GDP-mannose dehydrogenase family, UDP binding domain. The UDP-glucose/GDP-mannose dehydrogenaseses are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the re 7358 UDP-glucose/GDP-mannose dehydrogenase family, central domain. The UDP-glucose/GDP-mannose dehydrogenaseses are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the releas 7360 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri 7363 UDP-glucoronosyl and UDP-glucosyl transferase 7364 UDP-glucoronosyl and UDP-glucosyl transferase 7365 UDP-glucoronosyl and UDP-glucosyl transferase 7366 UDP-glucoronosyl and UDP-glucosyl transferase 7367 UDP-glucoronosyl and UDP-glucosyl transferase 7368 UDP-glucoronosyl and UDP-glucosyl transferase 7369 Zona pellucida-like domain 7372 Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phospha 7373 Fibronectin type III domain 7373 von Willebrand factor type A domain 7376 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7376 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7381 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly i 7388 Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formati 7389 Uroporphyrinogen decarboxylase (URO-D) 7391 Helix-loop-helix DNA-binding domain 7392 Helix-loop-helix DNA-binding domain 7399 Fibronectin type III domain 7399 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 7402 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 7403 jmjC domain 7404 jmjC domain 7404 jmjC domain 7407 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 7408 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 7409 PH domain. PH stands for pleckstrin homology 7409 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 7409 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 7409 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 7410 PH domain. PH stands for pleckstrin homology 7410 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 7410 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 7411 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 7414 Vinculin family 7414 Vinculin family 7415 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o 7416 Eukaryotic porin 7417 Eukaryotic porin 7419 Eukaryotic porin 7421 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 7421 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 7428 von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA 7430 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 7430 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 7431 Intermediate filament proteins 7432 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 7433 7 transmembrane receptor (Secretin family) 7434 7 transmembrane receptor (Secretin family) 7436 Low-density lipoprotein receptor domain class A 7436 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 7439 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 7442 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 7442 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7442 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 7442 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7442 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 7442 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7442 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 7442 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 7443 Protein kinase domain 7444 Protein kinase domain 7448 Somatomedin B domain 7448 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 7450 von Willebrand factor type A domain 7450 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 7450 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7453 WHEP-TRS domain 7454 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 7454 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 7455 von Willebrand factor type D domain 7455 MAM domain. An extracellular domain found in many receptors 7455 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 7455 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7455 von Willebrand factor type D domain 7455 MAM domain. An extracellular domain found in many receptors 7455 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 7455 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7455 von Willebrand factor type D domain 7455 MAM domain. An extracellular domain found in many receptors 7455 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 7455 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7455 von Willebrand factor type D domain 7455 MAM domain. An extracellular domain found in many receptors 7455 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 7455 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7455 von Willebrand factor type D domain 7455 MAM domain. An extracellular domain found in many receptors 7455 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 7455 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7455 von Willebrand factor type D domain 7455 MAM domain. An extracellular domain found in many receptors 7455 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 7455 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 7461 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 7461 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 7465 Protein kinase domain 7468 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 7468 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 7468 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 7468 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 7468 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 7468 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 7468 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 7468 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 7468 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 7471 wnt family 7472 wnt family 7473 wnt family 7474 wnt family 7475 wnt family 7477 wnt family 7480 wnt family 7482 wnt family 7482 wnt family 7483 wnt family 7484 wnt family 7486 HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease 7486 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 7490 Wilm's tumour protein 7490 Wilm's tumour protein 7490 Wilm's tumour protein 7490 Wilm's tumour protein 7507 XPA protein 7508 DNA repair protein Rad4 7515 XRCC1 N terminal domain 7515 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 7515 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 7525 SH2 domain 7525 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 7529 14-3-3 protein 7529 14-3-3 protein 7531 14-3-3 protein 7532 14-3-3 protein 7533 14-3-3 protein 7534 14-3-3 protein 7534 14-3-3 protein 7535 SH2 domain 7535 SH2 domain 7535 Protein kinase domain 7541 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 7551 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7551 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7554 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 7554 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7554 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 7561 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7565 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7566 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 7566 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7566 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 7586 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 7586 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7586 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 7587 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7597 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 7626 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7638 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7643 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 7673 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7694 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7695 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7703 Zinc finger, C3HC4 type (RING finger) 7704 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 7707 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 7709 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 7726 B-box zinc finger 7727 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 7733 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7743 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7745 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7752 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 7760 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 7760 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7760 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 7761 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7762 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 7763 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 7781 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 7783 Zona pellucida-like domain 7784 Zona pellucida-like domain 7786 Protein kinase domain 7791 LIM domain. This family represents two copies of the LIM structural domain 7799 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 7799 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 7804 Low-density lipoprotein receptor domain class A 7804 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 7804 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 7804 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 7805 Golgi 4-transmembrane spanning transporter 7812 'Cold-shock' DNA-binding domain 7827 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 7832 BTG1 family. A novel family of anti-proliferative proteins 7837 Animal haem peroxidase 7837 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 7837 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 7837 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 7841 Mannosyl oligosaccharide glucosidase. This is a family of eukaryotic enzymes belonging to glycosyl hydrolase family 63. They catalyse the specific cleavage of the non-reducing terminal glucose residue from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide 7849 'Paired box' domain 7849 'Paired box' domain 7849 'Paired box' domain 7849 'Paired box' domain 7849 'Paired box' domain 7852 7 transmembrane receptor (rhodopsin family) 7855 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 7857 Granin (chromogranin or secretogranin) 7862 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 7869 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 7879 Ras family. Includes sub-families Ras 7879 ADP-ribosylation factor family. Pfam combines a number of different Prosite families together 7903 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 7903 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 7904 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 7905 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 7917 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7917 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7917 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 7920 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 7923 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 7932 7 transmembrane receptor (rhodopsin family) 7941 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 7942 Helix-loop-helix DNA-binding domain 7957 Starch binding domain 7975 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 7976 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 7978 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 7982 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 7982 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 7988 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 7988 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 7993 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 7994 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 8000 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 8001 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 8001 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 8013 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8013 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8013 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8013 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8013 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8013 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8013 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8022 Homeobox domain 8022 LIM domain. This family represents two copies of the LIM structural domain 8022 Homeobox domain 8022 LIM domain. This family represents two copies of the LIM structural domain 8027 VHS domain. Domain present in VPS-27, Hrs and STAM 8029 CUB domain 8029 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 8034 Mitochondrial carrier protein 8038 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8038 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8038 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8038 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8048 LIM domain. This family represents two copies of the LIM structural domain 8050 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 8065 Cullin family 8085 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 8085 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 8087 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 8089 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 8092 Homeobox domain 8111 7 transmembrane receptor (rhodopsin family) 8115 TCL1/MTCP1 family. Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28 8120 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 8128 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 8131 Uncharacterised protein family (UPF0171) 8139 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 8165 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 8170 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 8178 Occludin/ELL family 8193 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 8195 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 8195 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 8200 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 8216 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 8216 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 8218 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 8218 Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le 8218 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 8218 Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le 8224 N-terminal domain 8224 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 8224 N-terminal domain 8224 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 8224 N-terminal domain 8224 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 8228 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 8233 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 8242 jmjC domain 8242 jmjN domain 8242 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 8242 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 8242 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 8260 Acetyltransferase (GNAT) family 8266 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 8273 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 8284 jmjC domain 8284 jmjN domain 8284 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 8284 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 8284 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 8288 Animal haem peroxidase 8289 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 8289 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 8289 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 8291 C2 domain 8301 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 8309 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 8310 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 8312 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 8312 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 8314 Ubiquitin carboxyl-terminal hydrolase, family 1 8315 Zn-finger in ubiquitin-hydrolases and other protein 8321 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 8322 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 8323 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 8324 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 8325 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 8326 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 8327 Sterile alpha motif (SAM)/Pointed domain 8337 Core histone H2A/H2B/H3/H4 8372 Hyaluronidase 8383 7 transmembrane receptor (rhodopsin family) 8385 7 transmembrane receptor (rhodopsin family) 8386 7 transmembrane receptor (rhodopsin family) 8387 7 transmembrane receptor (rhodopsin family) 8388 7 transmembrane receptor (rhodopsin family) 8390 7 transmembrane receptor (rhodopsin family) 8392 7 transmembrane receptor (rhodopsin family) 8395 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata 8398 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 8399 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 8402 Mitochondrial carrier protein 8402 Mitochondrial carrier protein 8403 HMG (high mobility group) box 8404 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 8405 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 8405 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 8406 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc 8409 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 8409 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 8411 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 8416 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 8419 Intermediate filament protein 8424 Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1 8425 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 8427 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 8427 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 8431 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8437 C2 domain 8439 Beige/BEACH domain 8440 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 8446 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 8447 C2 domain 8448 C2 domain 8450 Cullin family 8451 Cullin family 8452 Cullin family 8453 Cullin family 8454 Cullin family 8455 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 8455 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 8459 Sulfotransferase protein 8460 Sulfotransferase protein 8463 TEA/ATTS domain family 8464 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon 8467 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 8470 Sorbin homologous domain 8470 Sorbin homologous domain 8471 PTB domain (IRS-1 type) 8476 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 8476 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 8476 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 8476 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 8477 7 transmembrane receptor (rhodopsin family) 8484 7 transmembrane receptor (rhodopsin family) 8491 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 8495 pfam02920, integrase_DNA, DNA binding domain of tn916 integrase 8495 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 8497 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 8503 SH2 domain 8506 Fibrinogen beta and gamma chains, C-terminal globular domain 8506 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 8507 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 8507 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 8509 Sulfotransferase protein 8510 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 8511 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 8513 ab-hydrolase associated lipase region 8515 von Willebrand factor type A domain 8516 Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains 8516 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 8521 GCM motif protein 8526 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 8526 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 8527 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 8527 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 8528 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 8528 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 8529 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 8534 Sulfotransferase protein 8535 'chromo' (CHRromatin Organization MOdifier) domain 8538 Homeobox domain 8541 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 8543 LIM domain. This family represents two copies of the LIM structural domain 8546 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 8549 7 transmembrane receptor (rhodopsin family) 8553 Helix-loop-helix DNA-binding domain 8554 pfam02891, zf-MIZ, MIZ zinc finger 8554 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 8559 Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the sequ 8560 Fatty acid desaturase 8560 Fatty acid desaturase 8564 Monooxygenase. This family includes diverse enzymes that utilise FAD 8565 tRNA synthetases class I (W and Y) 8565 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8567 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 8567 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 8567 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 8572 LIM domain. This family represents two copies of the LIM structural domain 8573 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 8573 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 8575 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 8576 Protein kinase domain 8581 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 8590 7 transmembrane receptor (rhodopsin family) 8601 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 8602 Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production 8603 Beta type Zein 8604 Mitochondrial carrier protein 8605 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso 8608 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 8611 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8611 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8612 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8612 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8612 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8613 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8613 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 8614 Stanniocalcin family 8622 3'5'-cyclic nucleotide phosphodiesterase 8623 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 8626 P53 8629 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 8629 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 8630 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 8632 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 8635 Ribonuclease T2 family 8638 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 8639 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 8639 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 8639 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 8642 Cadherin domain 8643 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 8644 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 8645 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 8646 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 8646 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 8650 Phosphotyrosine interaction domain (PTB/PID) 8654 3'5'-cyclic nucleotide phosphodiesterase 8654 3'5'-cyclic nucleotide phosphodiesterase 8654 3'5'-cyclic nucleotide phosphodiesterase 8654 3'5'-cyclic nucleotide phosphodiesterase 8655 Dynein light chain type 1 8658 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 8659 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 8659 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 8660 PTB domain (IRS-1 type) 8660 PTB domain (IRS-1 type) 8660 PH domain. PH stands for pleckstrin homology 8662 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 8662 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 8665 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 8671 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 8672 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 8673 Synaptobrevin 8674 Synaptobrevin 8678 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso 8681 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso 8682 Death effector domain 8683 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 8687 Intermediate filament protein 8688 Intermediate filament protein 8689 Intermediate filament protein 8690 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 8692 Hyaluronidase 8692 Hyaluronidase 8693 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 8697 Anaphase promoting complex subunit 8 / cdc23. The anaphase-promoting complex is composed of eight protein subunits, including BimE (APC1), CDC27 (APC3), CDC16 (APC6), and CDC23 (APC8) 8698 7 transmembrane receptor (rhodopsin family) 8701 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 8702 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 8703 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 8704 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 8705 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8705 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 8705 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 8706 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8706 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8706 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8706 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8707 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8708 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 8710 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 8714 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 8718 Death domain 8718 Death domain 8718 Death domain 8718 Death domain 8718 Death domain 8720 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 8722 Papain family cysteine protease 8722 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 8724 PX domain. PX domains bind to phosphoinositides 8725 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 8725 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 8727 Vinculin family 8728 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8728 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8728 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8728 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8729 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 8732 mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme 8732 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 8733 Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly 8735 Myosin head (motor domain) 8737 Death domain 8741 TNF(Tumor Necrosis Factor) family 8741 TNF(Tumor Necrosis Factor) family 8741 TNF(Tumor Necrosis Factor) family 8744 TNF(Tumor Necrosis Factor) family 8745 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8745 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8747 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8747 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8748 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8748 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8749 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8749 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8751 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8751 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8754 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 8754 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 8756 Disintegrin 8756 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 8756 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 8760 Phosphatidate cytidylyltransferase 8761 Poly-adenylate binding protein, unique domain 8767 Protein kinase domain 8767 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 8772 Death domain 8772 Death effector domain 8775 NSF attachment protein 8785 von Willebrand factor type A domain 8785 von Willebrand factor type A domain 8785 von Willebrand factor type A domain 8786 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 8788 Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function 8789 Fructose-1-6-bisphosphatase 8795 TNFR/NGFR cysteine-rich region 8795 TNFR/NGFR cysteine-rich region 8797 TNFR/NGFR cysteine-rich region 8805 B-box zinc finger 8805 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 8807 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 8808 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 8809 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 8811 7 transmembrane receptor (rhodopsin family) 8812 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 8813 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 8815 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 8816 WD domain, G-beta repeat 8817 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 8817 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 8820 Homeobox domain 8822 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 8823 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 8824 Carboxylesterase 8825 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 8828 CUB domain 8828 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 8828 CUB domain 8828 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 8829 CUB domain 8829 MAM domain. An extracellular domain found in many receptors 8829 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 8833 GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. 8837 Death effector domain 8840 Thrombospondin type 1 domain 8840 Thrombospondin type 1 domain 8841 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 8843 7 transmembrane receptor (rhodopsin family) 8844 Protein kinase domain 8848 TSC-22/dip/bun family 8851 Cyclin-dependent kinase 5 activator protein 8853 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 8854 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 8854 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 8854 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 8856 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8856 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8856 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8856 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8856 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 8856 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 8857 von Willebrand factor type D domain 8857 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 8858 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 8858 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 8861 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i 8867 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 8869 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 8871 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 8876 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 8882 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with othe 8883 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 8888 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 8890 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 8892 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 8893 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 8893 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 8894 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 8896 G10 protein 8904 C2 domain 8904 C2 domain 8904 C2 domain 8904 C2 domain 8904 C2 domain 8904 C2 domain 8904 C2 domain 8904 C2 domain 8905 Clathrin adaptor complex small chain 8906 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 8906 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 8906 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 8906 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 8907 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 8911 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 8912 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 8913 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 8913 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 8915 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 8924 Anaphase-promoting complex, subunit 10 (APC10) 8924 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 8926 Sm protein 8929 Homeobox domain 8930 HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by 8932 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 8936 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 8938 C2 domain 8939 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 8941 Cyclin-dependent kinase 5 activator protein 8943 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 8971 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 8972 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 8973 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 8973 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 8976 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 8976 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 8976 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 8987 Starch binding domain 8988 Hsp20/alpha crystallin family 8989 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 8997 PH domain. PH stands for pleckstrin homology 9002 7 transmembrane receptor (rhodopsin family) 9016 Mitochondrial carrier protein 9016 Mitochondrial carrier protein 9025 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 9026 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 9026 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 9026 I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin 9026 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 9027 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 9028 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 9031 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 9031 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 9033 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 9034 7 transmembrane receptor (rhodopsin family) 9037 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 9037 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 9038 7 transmembrane receptor (rhodopsin family) 9039 Repeat in ubiquitin-activating (UBA) protein 9039 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 9040 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 9043 Intermediate filament protein 9045 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 9048 Transforming growth factor beta like domain 9048 Transforming growth factor beta like domain 9048 Transforming growth factor beta like domain 9048 Transforming growth factor beta like domain 9050 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 9052 7 transmembrane receptor (metabotropic glutamate family) 9054 Aminotransferase class-V 9054 Aminotransferase class-V 9055 Microtubule associated protein (MAP65/ASE1 family) 9058 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 9060 ATP-sulfurylase 9060 Adenylylsulfate kinase 9061 ATP-sulfurylase 9061 Adenylylsulfate kinase 9063 pfam02891, zf-MIZ, MIZ zinc finger 9063 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 9063 pfam02891, zf-MIZ, MIZ zinc finger 9063 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 9071 PMP-22/EMP/MP20/Claudin family 9071 PMP-22/EMP/MP20/Claudin family 9073 PMP-22/EMP/MP20/Claudin family 9074 PMP-22/EMP/MP20/Claudin family 9075 PMP-22/EMP/MP20/Claudin family 9076 PMP-22/EMP/MP20/Claudin family 9079 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i 9080 PMP-22/EMP/MP20/Claudin family 9086 Eukaryotic initiation factor 1A 9092 SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors 9095 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 9096 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 9097 Ubiquitin carboxyl-terminal hydrolase family 2 9101 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 9104 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 9104 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 9112 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 9113 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 9114 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 9118 Intermediate filament protein 9119 Intermediate filament protein 9122 Sugar (and other) transporter 9123 Monocarboxylate transporter 9124 LIM domain. This family represents two copies of the LIM structural domain 9124 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 9125 Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes 9127 ATP P2X receptor 9129 PWI domain 9131 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 9131 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 9131 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 9132 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 9132 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 9132 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 9132 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 9134 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 9134 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 9134 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 9135 Rabaptin 9139 MYND finger 9139 MYND finger 9140 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 9141 Protein of unknown function DUF122. This protein family has no known function 9143 Synaptogyrin. This family of proteins is distantly related to pfam01284 9144 Synaptogyrin. This family of proteins is distantly related to pfam01284 9145 Synaptogyrin. This family of proteins is distantly related to pfam01284 9145 Synaptogyrin. This family of proteins is distantly related to pfam01284 9145 Synaptogyrin. This family of proteins is distantly related to pfam01284 9146 VHS domain. Domain present in VPS-27, Hrs and STAM 9150 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 9150 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 9150 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 9150 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 9152 Sodium:neurotransmitter symporter family 9153 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 9154 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 9159 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 9167 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 9170 7 transmembrane receptor (rhodopsin family) 9172 Fibronectin type III domain 9173 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 9175 Protein kinase domain 9177 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 9177 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 9179 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 9180 Fibronectin type III domain 9187 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 9189 pfam02892, zf-BED, BED zinc finger 9211 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 9211 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 9213 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be 9213 SPX domain. We have named this region the SPX domain after (SYG1, Pho81 and XPR1). This 180 residue length domain is found at the amino terminus of a variety of proteins. In the yeast protein SYG1, the N-terminus directly binds to the G- protein beta sub 9215 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 9215 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 9217 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 9218 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 9219 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 9223 Guanylate kinase 9223 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 9223 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 9228 Guanylate-kinase-associated protein (GKAP) protein 9229 Guanylate-kinase-associated protein (GKAP) protein 9236 Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown 9236 Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown 9245 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 9246 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 9247 GCM motif protein 9248 7 transmembrane receptor (rhodopsin family) 9248 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 9253 Phosphotyrosine interaction domain (PTB/PID) 9254 Cache domain 9255 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act 9260 LIM domain. This family represents two copies of the LIM structural domain 9265 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 9266 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 9266 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 9267 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 9267 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 9271 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 9271 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 9271 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 9271 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 9276 Coatomer WD associated domain 9278 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9283 7 transmembrane receptor (rhodopsin family) 9284 Caldesmon 9287 7 transmembrane receptor (rhodopsin family) 9289 7 transmembrane receptor (Secretin family) 9289 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 9290 7 transmembrane receptor (rhodopsin family) 9293 7 transmembrane receptor (rhodopsin family) 9294 7 transmembrane receptor (rhodopsin family) 9296 ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F 9311 Amiloride-sensitive sodium channel 9311 Amiloride-sensitive sodium channel 9311 Amiloride-sensitive sodium channel 9312 Kv2 voltage-gated K+ channel 9312 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 9312 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 9313 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 9313 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 9317 Phosphotriesterase family 9320 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 9322 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 9331 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 9333 Transglutaminase family 9333 Transglutaminase family, C-terminal ig like domain 9333 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 9334 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 9340 7 transmembrane receptor (Secretin family) 9340 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 9341 Synaptobrevin 9348 Sulfotransferase protein 9350 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 9352 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 9353 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 9355 Homeobox domain 9355 LIM domain. This family represents two copies of the LIM structural domain 9356 Sugar (and other) transporter 9356 Sugar (and other) transporter 9356 Sugar (and other) transporter 9356 Sugar (and other) transporter 9356 Sugar (and other) transporter 9360 Cyclophilin type peptidyl-prolyl cis-trans isomerase 9362 C2 domain 9365 Glycosyl hydrolase family 1 9365 Glycosyl hydrolase family 1 9367 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 9370 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 9371 Kinesin motor domain 9372 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 9372 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 9372 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 9374 Palmitoyl protein thioesterase 9374 Palmitoyl protein thioesterase 9374 Palmitoyl protein thioesterase 9375 Endomembrane protein 70 9376 Sugar (and other) transporter 9377 Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va 9378 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 9381 C2 domain 9388 Lipase 9388 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 9389 Sugar (and other) transporter 9390 Sugar (and other) transporter 9399 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 9399 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 9404 LIM domain. This family represents two copies of the LIM structural domain 9407 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 9414 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 9415 Fatty acid desaturase 9415 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 9420 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 9423 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 9424 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 9435 Sulfotransferase protein 9445 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 9447 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 9447 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 9448 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 9450 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 9452 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 9453 Polyprenyl synthetase 9454 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 9455 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 9456 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 9462 C2 domain 9464 Helix-loop-helix DNA-binding domain 9470 Eukaryotic initiation factor 4E 9472 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 9474 Autophagy protein Apg5. Apg5p is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway 9476 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 9480 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 9481 Mitochondrial carrier protein 9486 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 9488 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 9493 Kinesin motor domain 9493 Kinesin motor domain 9495 Protein kinase A anchor 9497 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 9498 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 9500 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 9501 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 9507 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 9508 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 9508 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 9509 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 9509 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 9509 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 9509 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 9510 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 9510 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 9520 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 9522 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 9522 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 9524 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 9524 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 9525 MIT domain 9526 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 9529 BAG domain. Domain present in Hsp70 regulators 9530 BAG domain. Domain present in Hsp70 regulators 9531 BAG domain. Domain present in Hsp70 regulators 9532 BAG domain. Domain present in Hsp70 regulators 9535 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 9536 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 9542 Neuregulin family 9542 Neuregulin family 9542 Neuregulin family 9542 Neuregulin family 9543 Fibronectin type III domain 9543 Fibronectin type III domain 9543 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 9546 Phosphotyrosine interaction domain (PTB/PID) 9546 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 9547 Small cytokines (intecrine/chemokine) 9550 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 9559 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 9562 Histidine acid phosphatase 9568 7 transmembrane receptor (metabotropic glutamate family) 9568 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 9569 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 9569 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 9572 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 9572 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 9573 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 9578 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 9578 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 9581 Prolyl oligopeptidase family 9581 pfam02897, Peptidase_S9_N, Prolyl oligopeptidase, N-terminal beta-propeller domain. This unusual 7-stranded beta-propeller domain protects the catalytic triad of prolyl oligopeptidase (see pfam00326), excluding larger peptides and proteins from proteolysi 9582 Cytidine and deoxycytidylate deaminase zinc-binding region 9583 GDA1/CD39 (nucleoside phosphatase) family 9585 Kinesin motor domain 9600 Phosphatidylinositol transfer protein 9600 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 9601 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 9604 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 9610 Vacuolar sorting protein 9 (VPS9) domain 9610 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 9611 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 9612 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 9618 TRAF-type zinc finger 9618 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 9618 TRAF-type zinc finger 9618 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 9620 Cadherin domain 9620 7 transmembrane receptor (Secretin family) 9620 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 9620 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 9625 Protein kinase domain 9628 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 9630 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 9631 Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein 9631 Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein 9633 Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in 9636 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 9644 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 9648 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 9648 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 9649 PH domain. PH stands for pleckstrin homology 9654 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 9656 Lysosome-associated membrane glycoprotein (Lamp) 9656 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 9656 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 9658 Zinc finger, C2H2 type 9667 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 9671 Sulfotransferase protein 9671 WSC domain. This domain may be involved in carbohydrate binding 9672 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 9673 Mitochondrial carrier protein 9677 Histidine acid phosphatase 9678 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 9681 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 9682 jmjC domain 9682 jmjN domain 9685 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 9688 Nucleoporin interacting componentent 9695 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 9697 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 9700 Peptidase family C50 9702 Arsenical pump membrane protein 9705 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 9709 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 9711 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 9714 PH domain. PH stands for pleckstrin homology 9717 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 9717 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 9722 Phosphotyrosine interaction domain (PTB/PID) 9722 Phosphotyrosine interaction domain (PTB/PID) 9723 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 9725 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 9728 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 9739 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 9739 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 9741 Golgi 4-transmembrane spanning transporter 9744 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 9745 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 9749 RPEL repeat 9757 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 9757 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 9757 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 9758 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 9759 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 9765 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 9770 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 9770 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 9770 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 9777 Endomembrane protein 70 9781 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 9784 PX domain. PX domains bind to phosphoinositides 9786 Herpesvirus UL6 like 9787 Guanylate-kinase-associated protein (GKAP) protein 9791 Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos 9795 PX domain. PX domains bind to phosphoinositides 9796 Fibronectin type III domain 9797 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 9798 Eukaryotic protein of unknown function, DUF292 9801 Ribosomal protein L19 9804 MAS20 protein import receptor 9806 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 9807 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 9817 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9819 TSC-22/dip/bun family 9820 Cullin family 9820 Anaphase-promoting complex, subunit 10 (APC10) 9833 Kinase associated domain 1 9836 Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence 9837 Uncharacterised protein family (UPF0080) 9839 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 9841 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9842 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 9843 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 9843 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 9847 C2 domain 9853 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 9853 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 9855 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 9855 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 9857 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 9859 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 9863 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 9865 Fibronectin type III domain 9865 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 9866 B-box zinc finger 9869 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 9870 Filamin/ABP280 repeat 9878 HMG (high mobility group) box 9880 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9885 Oxysterol-binding protein 9885 Oxysterol-binding protein 9886 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9889 pfam02892, zf-BED, BED zinc finger 9890 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 9890 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 9892 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 9896 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 9899 Sugar (and other) transporter 9900 Sugar (and other) transporter 9901 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 9902 Lectin C-type domain. This family includes both long and short form C-type 9903 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9903 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 9905 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 9905 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 9909 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 9909 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 9915 Helix-loop-helix DNA-binding domain 9920 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9920 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 9924 Ubiquitin carboxyl-terminal hydrolase family 2 9925 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 9927 GTPase of unknown function 9928 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 9928 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 9929 Josephin 9931 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 9934 7 transmembrane receptor (rhodopsin family) 9935 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 9938 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 9947 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 9949 Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function 9951 Sulfotransferase protein 9953 Sulfotransferase protein 9955 Sulfotransferase protein 9956 Sulfotransferase protein 9957 Sulfotransferase protein 9961 Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein 9961 Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein 9962 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 9963 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 9963 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 9970 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 9970 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 9971 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 9971 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 9972 Zn-finger in Ran binding protein and others 9973 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 9975 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 9975 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 9980 Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in 9986 CAAX amino terminal protease family. Members of this family are probably proteases. the family contains CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidi 9987 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 9988 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 9989 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 9990 K-Cl Co-transporter type 1 (KCC1) 9991 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 9992 Slow voltage-gated potassium channel 9996 Sec63 domain 9996 pfam02889, Sec63, Sec63 domain 9996 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 9996 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 10002 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 10002 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 10002 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 10002 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 10003 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 10004 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 10004 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 10007 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 10008 Slow voltage-gated potassium channel 10009 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 10010 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 10013 Zn-finger in ubiquitin-hydrolases and other protein 10015 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 10021 Cyclic nucleotide-binding domain 10021 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 10026 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this 10038 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 10038 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 10039 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 10039 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 10040 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 10043 VHS domain. Domain present in VPS-27, Hrs and STAM 10043 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 10052 Connexin 10053 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 10054 Repeat in ubiquitin-activating (UBA) protein 10054 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 10055 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 10057 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 10059 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 10059 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 10059 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 10062 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 10062 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 10066 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 10067 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 10067 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 10071 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 10072 Peptidase family M49 10072 Peptidase family M49 10075 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 10075 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 10076 Fibronectin type III domain 10076 Fibronectin type III domain 10076 Fibronectin type III domain 10077 Tetraspanin family 10077 Tetraspanin family 10077 Tetraspanin family 10082 Glypican 10085 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 10085 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 10086 1 10094 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 10098 Tetraspanin family 10099 Tetraspanin family 10100 Tetraspanin family 10102 Elongation factor TS 10103 Tetraspanin family 10105 Cyclophilin type peptidyl-prolyl cis-trans isomerase 10106 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 10107 B-box zinc finger 10107 B-box zinc finger 10109 Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p 10109 Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p 10112 Kinesin motor domain 10123 ADP-ribosylation factor family 10124 ADP-ribosylation factor family 10125 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 10126 Dynein light chain type 1 10130 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 10139 ADP-ribosylation factor family 10140 BTG1 family. A novel family of anti-proliferative proteins 10143 Lectin C-type domain. This family includes both long and short form C-type 10147 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 10147 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 10149 7 transmembrane receptor (Secretin family) 10153 CBF/Mak21 family 10154 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 10154 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 10155 B-box zinc finger 10155 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 10157 Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases 10157 Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopin 10160 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 10161 7 transmembrane receptor (rhodopsin family) 10165 Mitochondrial carrier protein 10166 Mitochondrial carrier protein 10168 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10170 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 10171 RNA 3'-terminal phosphate cyclase 10175 Cornichon protein 10190 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 10197 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 10197 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 10197 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 10197 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 10199 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerev 10199 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 10201 Nucleoside diphosphate kinases 10202 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 10203 7 transmembrane receptor (Secretin family) 10206 B-box zinc finger 10206 B-box zinc finger 10209 Translation initiation factor SUI1 10211 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 10215 Helix-loop-helix DNA-binding domain 10216 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 10217 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 10220 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 10224 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10226 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 10235 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 10235 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 10235 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 10239 Clathrin adaptor complex small chain 10242 Calcium-activated potassium channel, beta subunit 10242 Calcium-activated potassium channel, beta subunit 10245 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 10247 Endoribonuclease L-PSP. Endoribonuclease active on single-stranded mRNA. Inhibits protein synthesis by cleavage of mRNA. Previously thought to inhibit protein synthesis initiation. This protein may also be involved in the regulation of purine biosynthesi 10250 PWI domain 10254 VHS domain. Domain present in VPS-27, Hrs and STAM 10260 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 10260 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 10260 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 10269 Peptidase family M48 10274 Stromal antigen (SA/STAG) protein 10279 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 10283 Cyclophilin type peptidyl-prolyl cis-trans isomerase 10287 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 10289 Translation initiation factor SUI1 10307 Phosphotyrosine interaction domain (PTB/PID) 10307 Phosphotyrosine interaction domain (PTB/PID) 10307 Phosphotyrosine interaction domain (PTB/PID) 10307 Phosphotyrosine interaction domain (PTB/PID) 10307 Phosphotyrosine interaction domain (PTB/PID) 10307 Phosphotyrosine interaction domain (PTB/PID) 10309 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 10311 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 10312 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 10312 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 10313 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 10316 7 transmembrane receptor (rhodopsin family) 10317 Galactosyltransferase 10317 Galactosyltransferase 10317 Galactosyltransferase 10317 Galactosyltransferase 10317 Galactosyltransferase 10319 Laminin B (Domain IV) 10319 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 10322 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 10324 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 10324 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 10325 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 10325 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 10327 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 10327 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 10328 Uncharacterised protein family (UPF0172) 10331 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 10332 Lectin C-type domain. This family includes both long and short form C-type 10333 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 10343 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 10343 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 10343 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 10346 B-box zinc finger 10346 Zinc finger, C3HC4 type (RING finger) 10351 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 10360 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 10361 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 10361 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 10362 HMG (high mobility group) box 10363 HMG (high mobility group) box 10368 PMP-22/EMP/MP20/Claudin family 10369 PMP-22/EMP/MP20/Claudin family 10371 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 10376 Tubulin/FtsZ family. This family includes the tubulin alpha 10379 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 10380 Inositol monophosphatase family 10380 Inositol monophosphatase family 10385 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 10385 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 10389 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 10392 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 10392 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 10393 Anaphase-promoting complex, subunit 10 (APC10) 10397 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 10400 Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids 10400 Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids 10400 Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids 10401 pfam02891, zf-MIZ, MIZ zinc finger 10402 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 10403 HEC/Ndc80p family. Members of this family are components of the mitotic spindle. It has been shown that Ndc80/HEC from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle 10404 Peptidase family M28D 10406 WAP-type (Whey Acidic Protein) 'four-disulfide core' 10411 Cyclic nucleotide-binding domain 10411 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 10418 Reeler domain 10418 Thrombospondin type 1 domain 10420 Protein kinase domain 10423 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 10423 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 10424 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 10425 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 10426 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 10434 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 10436 Suppressor Mra1. The suppressor Mra1 is found in high-copy-number when Ras1 is mutated, that recovers the mating deficiency caused by the decrease of Ras1 activity. Mutational analysis in yeast suggests that the suppressor Mra1 is essential for cell grow 10437 Gamma interferon inducible lysosomal thiol reductase (GILT). This family includes the two characterized human gamma-interferon-inducible lysosomal thiol reductase (GILT) sequences. It also contains several other eukaryotic putative proteins with similari 10440 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 10445 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 10449 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 10450 Cyclophilin type peptidyl-prolyl cis-trans isomerase 10451 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 10451 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 10454 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 10455 Acyl CoA binding protein 10459 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 10461 Fibronectin type III domain 10462 Hepatic lectin, N-terminal domain 10462 Lectin C-type domain. This family includes both long and short form C-type 10463 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 10465 Cyclophilin type peptidyl-prolyl cis-trans isomerase 10469 Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids o 10471 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 10472 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 10474 Putative replicase 1 (ORF2) 10475 B-box zinc finger 10478 Mitochondrial carrier protein 10479 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 10481 Homeobox domain 10482 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 10482 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 10483 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2 10483 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2 10483 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2 10484 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2 10486 CAP protein 10487 CAP protein 10497 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 10507 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 10509 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 10509 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 10512 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 10513 Cyclin 10522 MYND finger 10522 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 10523 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 10525 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 10529 Nebulin repeat 10533 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 10535 Ribonuclease HII 10537 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 10539 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 10544 Class I Histocompatibility antigen, domains alpha 1 and 2 10548 Endomembrane protein 70 10550 Prenylated rab acceptor (PRA1) 10554 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 10554 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 10555 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 10556 RNase P subunit p30. This protein is part of the RNase P complex that is involved in tRNA maturation 10557 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa 10559 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 10560 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 10567 Prenylated rab acceptor (PRA1) 10568 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 10570 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 10571 Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 10575 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 10577 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 10580 Sorbin homologous domain 10580 Sorbin homologous domain 10586 Mab-21 protein 10588 5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF 10599 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 10599 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 10605 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 10605 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 10606 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 10606 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 10610 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 10611 LIM domain. This family represents two copies of the LIM structural domain 10613 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 10615 Myosin tail 10617 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 10620 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 10625 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 10631 Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria 10636 Raf-like Ras-binding domain 10637 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 10641 Uncharacterised protein family (UPF0171) 10654 Phosphomevalonate kinase. Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This family represents the animal ty 10660 Homeobox domain 10663 7 transmembrane receptor (rhodopsin family) 10667 tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain 10672 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 10678 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 10678 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 10686 PMP-22/EMP/MP20/Claudin family 10687 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 10690 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 10691 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 10691 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 10692 7 transmembrane receptor (rhodopsin family) 10693 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 10694 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 10699 Low-density lipoprotein receptor domain class A 10699 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 10715 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 10717 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 10720 UDP-glucoronosyl and UDP-glucosyl transferase 10726 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 10734 Stromal antigen (SA/STAG) protein 10735 Stromal antigen (SA/STAG) protein 10747 Trypsin 10747 CUB domain 10747 Sushi domain (SCR repeat) 10749 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 10750 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 10752 Fibronectin type III domain 10753 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 10761 Zona pellucida-like domain 10765 jmjC domain 10765 jmjN domain 10765 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 10765 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 10765 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 10766 BTG1 family. A novel family of anti-proliferative proteins 10768 S-adenosyl-L-homocysteine hydrolase 10769 POLO box duplicated region 10771 MYND finger 10771 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 10772 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 10773 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 10782 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10782 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10782 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10791 Synaptobrevin 10794 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10794 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 10797 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 10798 7 transmembrane receptor (rhodopsin family) 10800 7 transmembrane receptor (rhodopsin family) 10801 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 10802 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 10803 7 transmembrane receptor (rhodopsin family) 10803 7 transmembrane receptor (rhodopsin family) 10804 Connexin 10807 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 10808 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 10810 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 10817 PTB domain (IRS-1 type) 10818 PTB domain (IRS-1 type) 10826 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 10838 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 10840 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 10840 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 10844 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 10846 3'5'-cyclic nucleotide phosphodiesterase 10847 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 10855 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular 10857 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 10858 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 10861 Sulfate transporter family. Mutations may lead to several human diseases 10863 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 10863 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 10863 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 10863 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 10863 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 10863 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 10864 Sugar (and other) transporter 10865 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 10867 Tetraspanin family 10869 MYND finger 10869 Ubiquitin carboxyl-terminal hydrolase family 2 10869 Ubiquitin carboxyl-terminal hydrolases family 2 10873 Malic enzyme, NAD binding domain 10874 Neuromedin U 10875 Fibrinogen beta and gamma chains, C-terminal globular domain 10877 Sushi domain (SCR repeat) 10878 Sushi domain (SCR repeat) 10880 Actin 10881 Actin 10885 WD domain, G-beta repeat 10885 Dip2/Utp12 Family. This domain is found at the C-terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein the yeast protein is called Utp12 or DIP2 10886 7 transmembrane receptor (rhodopsin family) 10886 7 transmembrane receptor (rhodopsin family) 10887 7 transmembrane receptor (rhodopsin family) 10888 7 transmembrane receptor (rhodopsin family) 10890 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 10892 Death domain 10892 Death domain 10893 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 10893 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 10893 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 10897 Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1 10900 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 10901 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 10905 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 10906 TRAF-type zinc finger 10906 7 transmembrane receptor (rhodopsin family) 10907 Mitosis protein DIM1 10908 Cyclic nucleotide-binding domain 10911 Urotensin II 10912 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 10913 Death domain 10915 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 10916 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 10916 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 10919 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 10919 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 10923 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p1 10927 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 10936 7 transmembrane receptor (rhodopsin family) 10941 UDP-glucoronosyl and UDP-glucosyl transferase 10945 ER lumen protein retaining receptor 10947 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 10950 BTG1 family. A novel family of anti-proliferative proteins 10951 'chromo' (CHRromatin Organization MOdifier) domain 10951 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 10952 Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG 10954 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 10955 TMS membrane protein/tumour differentially expressed protein (TDE) 10959 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 10960 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 10961 Transposase 10963 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 10969 Cobalt transport protein 10971 14-3-3 protein 10972 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 10983 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 10988 metallopeptidase family M24 10991 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 10992 PSP. Proline rich domain found in numerous spliceosome associated proteins 10992 Domain of unknown function (DUF382). This domain is specific to the human splicing factor 3b subunit 2 and it's orthologs 10992 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 11004 Kinesin motor domain 11012 Trypsin 11014 ER lumen protein retaining receptor 11015 ER lumen protein retaining receptor 11015 ER lumen protein retaining receptor 11018 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 11020 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 11022 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 11022 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 11031 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 11033 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 11034 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 11035 Protein kinase domain 11036 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 11036 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 11037 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 11039 Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 11043 B-box zinc finger 11043 B-box zinc finger 11045 Tetraspanin family 11051 MutT-like domain 11059 C2 domain 11060 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 11061 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 11062 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou 11065 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 11069 Cyclic nucleotide-binding domain 11069 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 11073 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 11075 Stathmin family 11079 Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex 11083 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11083 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11083 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11085 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11085 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11086 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11086 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11086 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11086 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11086 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11086 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11093 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11093 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11093 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11093 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11095 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11095 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11096 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11096 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11102 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 11113 PH domain. PH stands for pleckstrin homology 11113 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 11113 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11113 Rotavirus non-structural protein NSP3. This family consist of rotaviral non-structural RNA binding protein 34 (NS34 or NSP3). The NSP3 protein has been shown to bind viral RNA. The NSP3 protein consists of 3 conserved functional domains 11122 Fibronectin type III domain 11122 Fibronectin type III domain 11127 Kinesin motor domain 11128 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 11128 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu 11138 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 11139 PH domain. PH stands for pleckstrin homology 11140 Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90 11141 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 11142 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 11142 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 11142 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 11143 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 11145 NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins 11153 Fic protein family. This family consists of the Fic (filamentation induced by cAMP) protein and its relatives. The Fic protein is involved in cell division and is suggested to be involved in the synthesis of PAB or folate, indicating that the Fic protein 11155 LIM domain. This family represents two copies of the LIM structural domain 11155 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 11157 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 11160 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 11161 Uncharacterised protein family (UPF0143). This family of uncharacterised proteins are integral membrane proteins. They may contain 4 transmembrane helices. The family contains a conserved arginine and histidine that may be functionally important 11162 NUDIX domain 11163 NUDIX domain 11164 NUDIX domain 11165 NUDIX domain 11166 HMG (high mobility group) box 11168 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 11168 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 11173 Thrombospondin type 1 domain 11173 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11173 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11174 Thrombospondin type 1 domain 11174 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11174 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11176 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11176 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 11177 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11177 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 11177 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11177 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 11181 Trehalase 11182 Sugar (and other) transporter 11185 NNMT/PNMT/TEMT family 11186 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 11186 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11186 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 11186 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 11186 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 11186 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11186 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 11186 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11186 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11186 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11188 PX domain. PX domains bind to phosphoinositides 11193 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 11196 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 11199 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11200 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 11201 impB/mucB/samB family. These proteins are involved in UV protection 11211 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 11213 Death domain 11213 Protein kinase domain 11221 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 11221 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 11221 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 11222 Ribosomal protein L3 11224 Sodium:alanine symporter family 11226 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 11227 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 11227 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 11238 Eukaryotic-type carbonic anhydrase 11240 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 11245 7 transmembrane receptor (rhodopsin family) 11250 7 transmembrane receptor (rhodopsin family) 11251 7 transmembrane receptor (rhodopsin family) 11252 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 11253 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 11253 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 11254 Sodium:neurotransmitter symporter family 11255 7 transmembrane receptor (rhodopsin family) 11260 Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore comp 11262 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11262 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 11262 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 11267 EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of t 11275 Kelch motif 11275 BTB/POZ domain 11280 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 11281 Homeobox domain 11283 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 11285 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 11287 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 11287 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 11309 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 11309 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 11311 Sec1 family 11313 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 11314 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 11315 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 11317 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 11318 7 transmembrane receptor (rhodopsin family) 11321 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 11328 FKBP-type peptidyl-prolyl cis-trans isomerase 11331 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 11332 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 11332 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 11332 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 11332 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 11332 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 11332 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 11335 'chromo' (CHRromatin Organization MOdifier) domain 11335 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 11335 'chromo' (CHRromatin Organization MOdifier) domain 11335 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 11337 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 11340 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 11340 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 11345 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 11350 Protein kinase domain 11352 SH2 domain 11352 Protein kinase domain 11352 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 11354 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 11418 Amiloride-sensitive sodium channel 11419 Amiloride-sensitive sodium channel 11421 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 11421 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 11423 Carboxylesterase 11423 Carboxylesterase 11426 Growth-Arrest-Specific Protein 2 Domain 11426 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 11426 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 11428 Aconitase family (aconitate hydratase) 11428 Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism 11430 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 11432 Histidine acid phosphatase 11432 Histidine acid phosphatase 11433 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 11435 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11435 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11438 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11438 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11440 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11440 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11441 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11441 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11443 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11443 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11444 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11444 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11447 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11447 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11448 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11448 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11449 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 11449 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 11450 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 11459 Actin 11459 Actin 11461 Actin 11464 Actin 11465 Actin 11465 Actin 11468 Actin 11469 Actin 11470 Actin 11471 Actin 11472 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 11474 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 11475 Actin 11477 Protein kinase domain 11477 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 11479 Protein kinase domain 11479 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 11479 Protein kinase domain 11479 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t 11480 Protein kinase domain 11480 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 11481 Protein kinase domain 11481 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 11482 Protein kinase domain 11482 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 11486 Adenosine/AMP deaminase 11487 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11488 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11488 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11489 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11489 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11490 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11490 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11491 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11492 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11492 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11495 Disintegrin 11495 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 11495 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 11496 Disintegrin 11496 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 11496 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 11497 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11497 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11498 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11498 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11499 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11499 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11500 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11500 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11501 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11501 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11502 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11502 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11504 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 11504 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 11512 Adenylate and Guanylate cyclase catalytic domain 11513 Adenylate and Guanylate cyclase catalytic domain 11514 Adenylate and Guanylate cyclase catalytic domain 11515 Adenylate and Guanylate cyclase catalytic domain 11516 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 11517 7 transmembrane receptor (Secretin family) 11518 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 11519 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 11520 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 11534 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 11535 Adrenomedullin 11536 7 transmembrane receptor (rhodopsin family) 11539 7 transmembrane receptor (rhodopsin family) 11540 7 transmembrane receptor (rhodopsin family) 11541 7 transmembrane receptor (rhodopsin family) 11542 7 transmembrane receptor (rhodopsin family) 11542 7 transmembrane receptor (rhodopsin family) 11544 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi 11545 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc 11545 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 11545 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 11546 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 11546 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 11548 7 transmembrane receptor (rhodopsin family) 11548 7 transmembrane receptor (rhodopsin family) 11549 7 transmembrane receptor (rhodopsin family) 11551 7 transmembrane receptor (rhodopsin family) 11552 7 transmembrane receptor (rhodopsin family) 11553 7 transmembrane receptor (rhodopsin family) 11553 7 transmembrane receptor (rhodopsin family) 11554 7 transmembrane receptor (rhodopsin family) 11555 7 transmembrane receptor (rhodopsin family) 11555 7 transmembrane receptor (rhodopsin family) 11556 7 transmembrane receptor (rhodopsin family) 11565 Adenylosuccinate synthetase 11566 Adenylosuccinate synthetase 11567 Villin headpiece domain 11568 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 11593 Asparaginase 11595 Extracellular link domain 11595 Sushi domain (SCR repeat) 11596 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 11596 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 11601 Intermediate filament protein 11603 Laminin G domain 11603 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 11603 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 11603 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 11603 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 11603 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 11605 Melibiase 11606 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 11607 7 transmembrane receptor (rhodopsin family) 11608 7 transmembrane receptor (rhodopsin family) 11609 7 transmembrane receptor (rhodopsin family) 11622 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 11622 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 11625 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 11634 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 11634 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 11634 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 11636 Adenylate kinase 11637 Adenylate kinase 11639 Adenylate kinase 11640 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 11647 Alkaline phosphatase 11648 Alkaline phosphatase 11650 Alkaline phosphatase 11651 PH domain. PH stands for pleckstrin homology 11657 Serum albumin family 11668 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 11669 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 11670 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 11671 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 11674 Fructose-bisphosphate aldolase class-I 11676 Fructose-bisphosphate aldolase class-I 11677 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 11682 MAM domain. An extracellular domain found in many receptors 11684 Lipoxygenase 11685 Lipoxygenase 11686 Lipoxygenase 11686 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 11687 Lipoxygenase 11688 Lipoxygenase 11688 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 11690 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 11694 Homeobox domain 11695 OAR domain 11695 Homeobox domain 11699 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 11702 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi 11702 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 11703 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi 11703 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 11704 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m 11705 Transforming growth factor beta like domain 11717 Adenosine/AMP deaminase 11722 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe 11722 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st 11723 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe 11723 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st 11727 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 11730 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 11731 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 11733 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11735 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 11736 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 11739 Mitochondrial carrier protein 11740 Mitochondrial carrier protein 11744 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11745 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11746 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11747 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11749 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11749 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11750 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11752 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 11754 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 11754 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 11754 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 11764 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 11764 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11764 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 11764 Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found 11764 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 11765 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 11765 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11766 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 11766 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11767 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 11768 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 11769 Clathrin adaptor complex small chain 11770 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 11771 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 11771 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 11771 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11772 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 11772 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 11772 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11773 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 11774 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11775 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11776 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11778 Clathrin adaptor complex small chain 11779 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 11781 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 11783 NB-ARC domain 11783 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 11784 Phosphotyrosine interaction domain (PTB/PID) 11784 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 11785 Phosphotyrosine interaction domain (PTB/PID) 11785 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 11785 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 11787 Phosphotyrosine interaction domain (PTB/PID) 11787 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 11787 Phosphotyrosine interaction domain (PTB/PID) 11787 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 11789 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 11789 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 11793 Autophagy protein Apg5. Apg5p is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway 11796 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 11797 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 11803 Amyloid A4 extracellular domain 11806 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 11808 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 11808 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E 11808 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 11810 Cytidine and deoxycytidylate deaminase zinc-binding region 11816 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 11818 Sushi domain (SCR repeat) 11819 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 11819 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 11820 Amyloid A4 extracellular domain 11820 Beta-amyloid peptide (beta-APP) 11821 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 11826 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11827 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11829 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11829 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11830 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11831 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11833 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 11835 Androgen receptor 11835 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 11835 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 11836 Protein kinase domain 11836 Raf-like Ras-binding domain 11836 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 11837 Ribosomal protein L10 11838 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 11846 Arginase family 11847 Arginase family 11855 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 11855 FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions 11857 RHO protein GDP dissociation inhibitor 11859 Homeobox domain 11861 ADP-ribosylation factor family 11864 Helix-loop-helix DNA-binding domain 11865 Helix-loop-helix DNA-binding domain 11870 NAD:arginine ADP-ribosyltransferase 11871 NAD:arginine ADP-ribosyltransferase 11871 NAD:arginine ADP-ribosyltransferase 11872 NAD:arginine ADP-ribosyltransferase 11875 NAD:arginine ADP-ribosyltransferase 11876 Transforming growth factor beta like domain 11878 Homeobox domain 11881 Sulfatase 11883 Sulfatase 11883 Sulfatase 11886 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 11889 Hepatic lectin, N-terminal domain 11889 Lectin C-type domain. This family includes both long and short form C-type 11889 Hepatic lectin, N-terminal domain 11889 Lectin C-type domain. This family includes both long and short form C-type 11890 Hepatic lectin, N-terminal domain 11890 Lectin C-type domain. This family includes both long and short form C-type 11891 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 11898 Arginosuccinate synthase. This family contains a PP-loop motif 11905 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 11906 Homeobox domain 11908 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 11908 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 11909 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 11911 bZIP transcription factor 11920 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 11923 Helix-loop-helix DNA-binding domain 11924 Helix-loop-helix DNA-binding domain 11925 Helix-loop-helix DNA-binding domain 11927 Heavy-metal-associated domain 11931 Sodium / potassium ATPase beta chain 11932 Sodium / potassium ATPase beta chain 11933 Sodium / potassium ATPase beta chain 11936 ATP1G1/PLM/MAT8 family 11936 ATP1G1/PLM/MAT8 family 11936 ATP1G1/PLM/MAT8 family 11937 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 11945 Sodium / potassium ATPase beta chain 11947 ATP synthase alpha/beta chain, C terminal domain 11947 ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 11947 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 11947 pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 11949 ATP synthase 11951 ATP synthase subunit C 11972 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 11973 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 11975 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 11977 Heavy-metal-associated domain 11979 Heavy-metal-associated domain 11984 ATP synthase subunit C 11991 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 11992 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 11993 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 11994 Cadherin domain 11997 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 11998 Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5 12000 7 transmembrane receptor (rhodopsin family) 12000 7 transmembrane receptor (rhodopsin family) 12005 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D 12007 Class I Histocompatibility antigen, domains alpha 1 and 2 12013 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 12014 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 12018 Apoptosis regulator proteins, Bcl-2 family 12020 Homeobox domain 12022 Homeobox domain 12023 Homeobox domain 12028 Apoptosis regulator proteins, Bcl-2 family 12029 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 12032 Extracellular link domain 12032 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 12034 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 12034 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 12034 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 12035 Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci 12036 Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino acid 12038 Carboxylesterase 12039 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 12040 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 12040 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 12041 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 12042 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 12043 Apoptosis regulator proteins, Bcl-2 family 12043 Apoptosis regulator proteins, Bcl-2 family 12044 Apoptosis regulator proteins, Bcl-2 family 12045 Apoptosis regulator proteins, Bcl-2 family 12046 Apoptosis regulator proteins, Bcl-2 family 12047 Apoptosis regulator proteins, Bcl-2 family 12048 Apoptosis regulator proteins, Bcl-2 family 12050 Apoptosis regulator proteins, Bcl-2 family 12053 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 12057 7 transmembrane receptor (rhodopsin family) 12061 7 transmembrane receptor (rhodopsin family) 12062 7 transmembrane receptor (rhodopsin family) 12064 Nerve growth factor family 12075 Intermediate filament protein 12091 Glycosyl hydrolases family 35 12096 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 12097 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 12116 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 12116 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 12140 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 12143 SH2 domain 12143 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 12145 7 transmembrane receptor (rhodopsin family) 12151 Zinc finger, C3HC4 type (RING finger) 12153 CUB domain 12154 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12156 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12159 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12160 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12161 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12162 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12163 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12164 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12164 Transforming growth factor beta like domain 12164 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12165 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12165 Transforming growth factor beta like domain 12165 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 12166 Protein kinase domain 12166 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 12167 Protein kinase domain 12167 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 12168 Protein kinase domain 12168 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 12180 MYND finger 12182 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 12182 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 12189 Zinc finger, C3HC4 type (RING finger) 12189 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 12190 BRCA2 repeat. The alignment covers only the most conserved region of the repeat 12192 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 12209 7 transmembrane receptor (rhodopsin family) 12211 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 12211 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 12215 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 12224 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 12226 BTG1 family. A novel family of anti-proliferative proteins 12227 BTG1 family. A novel family of anti-proliferative proteins 12228 BTG1 family. A novel family of anti-proliferative proteins 12229 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 12235 Protein kinase domain 12235 Protein kinase domain 12236 Protein kinase domain 12236 Protein kinase domain 12257 TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia 12261 Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inte 12262 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12263 Sushi domain (SCR repeat) 12263 von Willebrand factor type A domain 12263 Trypsin 12263 Sushi domain (SCR repeat) 12263 von Willebrand factor type A domain 12263 Trypsin 12263 Sushi domain (SCR repeat) 12263 von Willebrand factor type A domain 12266 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 12266 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 12266 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 12266 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 12267 7 transmembrane receptor (rhodopsin family) 12268 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 12268 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 12268 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 12268 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 12269 Sushi domain (SCR repeat) 12273 7 transmembrane receptor (rhodopsin family) 12274 Sushi domain (SCR repeat) 12274 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 12279 Low-density lipoprotein receptor domain class A 12279 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 12282 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 12283 Mo25 protein family 12286 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12286 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12287 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12288 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12289 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12290 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12290 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12291 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12292 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12293 Cache domain 12294 Cache domain 12295 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 12295 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 12295 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 12296 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 12296 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 12296 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 12297 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 12298 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 12300 PMP-22/EMP/MP20/Claudin family 12304 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 12305 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 12305 Protein kinase domain 12305 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 12306 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 12306 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 12309 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 12309 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 12310 Calcitonin / CGRP / IAPP family 12311 7 transmembrane receptor (Secretin family) 12311 7 transmembrane receptor (Secretin family) 12311 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 12317 Calreticulin family 12319 Eukaryotic-type carbonic anhydrase 12325 Protein kinase domain 12326 Protein kinase domain 12330 Calreticulin family 12331 CAP protein 12333 Calpain family cysteine protease 12333 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 12334 Calpain family cysteine protease 12334 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 12334 Calpain family cysteine protease 12334 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 12335 Calpain family cysteine protease 12335 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 12337 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 12337 Calpain family cysteine protease 12337 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 12338 C2 domain 12338 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 12339 MIT domain 12340 F-actin capping protein alpha subunit 12340 F-actin capping protein alpha subunit 12343 F-actin capping protein alpha subunit 12344 F-actin capping protein alpha subunit 12345 F-actin capping protein, beta subunit 12346 Eukaryotic-type carbonic anhydrase 12348 Eukaryotic-type carbonic anhydrase 12349 Eukaryotic-type carbonic anhydrase 12350 Eukaryotic-type carbonic anhydrase 12351 Eukaryotic-type carbonic anhydrase 12352 Eukaryotic-type carbonic anhydrase 12353 Eukaryotic-type carbonic anhydrase 12354 Eukaryotic-type carbonic anhydrase 12354 Eukaryotic-type carbonic anhydrase 12355 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 12355 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 12359 Catalase 12361 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 12361 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 12361 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 12361 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 12362 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 12363 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 12366 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 12370 Death effector domain 12370 ICE-like protease (caspase) p10 domain 12370 ICE-like protease (caspase) p20 domain 12371 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 12372 Calsequestrin 12373 Calsequestrin 12374 7 transmembrane receptor (metabotropic glutamate family) 12374 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 12378 7 transmembrane receptor (metabotropic glutamate family) 12378 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 12379 7 transmembrane receptor (metabotropic glutamate family) 12379 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 12380 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 12380 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 12385 Vinculin family 12386 Vinculin family 12386 Vinculin family 12387 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 12388 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 12389 Caveolin 12390 Caveolin 12391 Caveolin 12393 Runt domain 12394 Runt domain 12395 MYND finger 12398 MYND finger 12399 Runt domain 12400 Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has b 12401 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 12402 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 12402 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co 12402 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser 12406 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 12406 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 12408 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 12409 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 12412 'chromo' (CHRromatin Organization MOdifier) domain 12412 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 12416 'chromo' (CHRromatin Organization MOdifier) domain 12417 'chromo' (CHRromatin Organization MOdifier) domain 12417 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 12418 'chromo' (CHRromatin Organization MOdifier) domain 12419 'chromo' (CHRromatin Organization MOdifier) domain 12419 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 12424 Gastrin/cholecystokinin family 12425 7 transmembrane receptor (rhodopsin family) 12426 7 transmembrane receptor (rhodopsin family) 12428 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12443 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12444 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12445 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12445 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 12445 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 12447 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi 12447 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12448 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12449 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12450 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12452 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12453 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 12454 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 12457 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 12457 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 12458 7 transmembrane receptor (rhodopsin family) 12460 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 12462 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12464 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12465 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12466 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12466 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12467 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12469 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 12476 Tetraspanin family 12482 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 12483 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 12491 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 12492 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 12493 Tetraspanin family 12494 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 12495 GDA1/CD39 (nucleoside phosphatase) family 12496 GDA1/CD39 (nucleoside phosphatase) family 12497 GDA1/CD39 (nucleoside phosphatase) family 12499 GDA1/CD39 (nucleoside phosphatase) family 12505 Extracellular link domain 12505 Extracellular link domain 12507 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 12508 Tetraspanin family 12509 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 12511 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 12512 Tetraspanin family 12514 Lysosome-associated membrane glycoprotein (Lamp) 12515 Lectin C-type domain. This family includes both long and short form C-type 12520 Tetraspanin family 12521 Tetraspanin family 12527 Tetraspanin family 12539 Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90 12545 Protein kinase domain 12550 Cadherin domain 12550 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12551 Cadherin domain 12551 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 12552 Cadherin domain 12552 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12554 Cadherin domain 12555 Cadherin domain 12555 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12556 Cadherin domain 12557 Cadherin domain 12558 Cadherin domain 12558 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12560 Cadherin domain 12560 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 12561 Cadherin domain 12561 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12562 Cadherin domain 12562 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12563 Cadherin domain 12563 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12564 Cadherin domain 12564 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 12565 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 12569 Cyclin-dependent kinase 5 activator protein 12570 Cyclin-dependent kinase 5 activator protein 12572 Protein kinase domain 12575 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 12576 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 12577 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 12577 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 12577 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 12590 Homeobox domain 12591 Homeobox domain 12592 Homeobox domain 12607 CBF/Mak21 family 12613 Carboxylesterase 12614 Cadherin domain 12614 7 transmembrane receptor (Secretin family) 12614 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 12614 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 12616 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 12622 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 12623 Carboxylesterase 12628 Sushi domain (SCR repeat) 12630 Low-density lipoprotein receptor domain class A 12631 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 12633 Death effector domain 12640 Glycoprotein hormone 12648 'chromo' (CHRromatin Organization MOdifier) domain 12648 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 12651 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 12652 Granin (chromogranin or secretogranin) 12653 Granin (chromogranin or secretogranin) 12654 Glycosyl hydrolases family 18 12660 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 12661 Fibronectin type III domain 12662 GDP dissociation inhibitor 12663 GDP dissociation inhibitor 12667 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 12669 7 transmembrane receptor (rhodopsin family) 12671 7 transmembrane receptor (rhodopsin family) 12672 7 transmembrane receptor (rhodopsin family) 12677 Homeobox domain 12683 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 12684 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 12686 GNS1/SUR4 family 12704 Tropomyosin 12704 Protein kinase domain 12704 Uncharacterized ACR, COG1579 12704 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 12704 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 12704 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 12704 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 12709 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 12709 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 12715 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 12715 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 12716 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 12716 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 12723 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12723 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12724 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12725 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12727 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12728 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12729 Nucleotide-sensitive chloride conductance regulator (ICln) 12733 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 12737 PMP-22/EMP/MP20/Claudin family 12738 PMP-22/EMP/MP20/Claudin family 12739 PMP-22/EMP/MP20/Claudin family 12740 PMP-22/EMP/MP20/Claudin family 12741 PMP-22/EMP/MP20/Claudin family 12741 PMP-22/EMP/MP20/Claudin family 12745 Calreticulin family 12745 Calreticulin family 12752 CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease) 12753 Helix-loop-helix DNA-binding domain 12753 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 12753 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 12757 Clathrin light chain 12759 Clusterin 12764 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-oc 12765 7 transmembrane receptor (rhodopsin family) 12766 7 transmembrane receptor (rhodopsin family) 12767 7 transmembrane receptor (rhodopsin family) 12768 7 transmembrane receptor (rhodopsin family) 12769 7 transmembrane receptor (rhodopsin family) 12770 7 transmembrane receptor (rhodopsin family) 12771 7 transmembrane receptor (rhodopsin family) 12772 7 transmembrane receptor (rhodopsin family) 12773 7 transmembrane receptor (rhodopsin family) 12774 7 transmembrane receptor (rhodopsin family) 12775 7 transmembrane receptor (rhodopsin family) 12776 7 transmembrane receptor (rhodopsin family) 12777 7 transmembrane receptor (rhodopsin family) 12778 7 transmembrane receptor (rhodopsin family) 12788 Cyclic nucleotide-binding domain 12788 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12789 Cyclic nucleotide-binding domain 12789 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12790 Cyclic nucleotide-binding domain 12790 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 12793 Cornichon protein 12794 Cornichon protein 12795 POLO box duplicated region 12796 Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins 12797 Calponin family repeat 12798 Calponin family repeat 12801 7 transmembrane receptor (rhodopsin family) 12802 7 transmembrane receptor (rhodopsin family) 12803 Ciliary neurotrophic factor 12803 Ciliary neurotrophic factor 12805 Fibronectin type III domain 12810 LCCL domain 12810 von Willebrand factor type A domain 12813 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 12814 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 12814 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12815 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 12816 Fibronectin type III domain 12816 von Willebrand factor type A domain 12816 Thrombospondin N-terminal -like domain 12817 Collagen triple helix repeat (20 copies) 12818 Fibronectin type III domain 12818 von Willebrand factor type A domain 12818 von Willebrand factor type A domain 12818 Thrombospondin N-terminal -like domain 12818 Thrombospondin N-terminal -like domain 12819 Thrombospondin N-terminal -like domain 12819 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12821 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12822 Thrombospondin N-terminal -like domain 12822 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12823 Thrombospondin N-terminal -like domain 12823 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12824 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 12825 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 12826 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 12827 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 12827 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12828 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 12829 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 12830 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 12830 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 12832 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 12832 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 12832 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 12832 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 12833 von Willebrand factor type A domain 12833 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12834 von Willebrand factor type A domain 12834 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12834 von Willebrand factor type A domain 12835 von Willebrand factor type A domain 12835 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12835 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 12836 Fibronectin type III domain 12836 von Willebrand factor type A domain 12836 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12837 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 12839 Thrombospondin N-terminal -like domain 12839 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 12847 Coatomer WD associated domain 12847 WD domain, G-beta repeat 12847 Coatomer WD associated domain 12854 Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som 12857 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 12858 Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va 12859 Cytochrome c oxidase subunit Vb 12861 Cytochrome c oxidase subunit VIa 12862 Cytochrome c oxidase subunit VIa 12864 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 12865 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 12866 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 12867 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast me 12867 Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family 12867 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem 12867 Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family 12867 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem 12868 Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII 12869 Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII 12870 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 12873 Zinc carboxypeptidase 12873 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 12876 Zinc carboxypeptidase 12891 C2 domain 12892 Coproporphyrinogen III oxidase 12894 Choline/Carnitine o-acyltransferase 12895 Choline/Carnitine o-acyltransferase 12896 Choline/Carnitine o-acyltransferase 12902 Sushi domain (SCR repeat) 12903 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 12904 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 12908 Choline/Carnitine o-acyltransferase 12912 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 12912 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 12912 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 12912 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 12916 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 12916 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 12921 7 transmembrane receptor (Secretin family) 12922 7 transmembrane receptor (Secretin family) 12925 LIM domain. This family represents two copies of the LIM structural domain 12928 SH2 domain 12928 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 12929 SH2 domain 12929 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 12933 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 12933 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 12934 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 12944 Pentaxin family. Pentaxins are also known as pentraxins 12945 CUB domain 12945 Zona pellucida-like domain 12945 CUB domain 12945 Zona pellucida-like domain 12945 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 12946 Sushi domain (SCR repeat) 12950 Extracellular link domain 12951 Homeobox domain 12952 FAD binding domain of DNA photolyase 12953 FAD binding domain of DNA photolyase 12954 Hsp20/alpha crystallin family 12954 Alpha crystallin A chain, N terminal 12954 Hsp20/alpha crystallin family 12954 Alpha crystallin A chain, N terminal 12955 Hsp20/alpha crystallin family 12955 Alpha crystallin A chain, N terminal 12957 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12958 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12959 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12960 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12961 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12962 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12964 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12966 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12967 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12968 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12969 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12970 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 12971 Ornithine cyclodeaminase/mu-crystallin family. This family contains the bacterial Ornithine cyclodeaminase enzyme EC:4.3.1.12, which catalyses the deamination of ornithine to proline. This family also contains mu-Crystallin the major component of the eye 12972 Zinc-binding dehydrogenase 12974 Citrate synthase 12981 Granulocyte-macrophage colony-stimulating factor 12985 Interleukin-6/G-CSF/MGF family 12986 Fibronectin type III domain 12990 Casein 12991 Casein 12992 Casein 12994 Kappa casein. Kappa-casein is a mammalian milk protein involved in a number of important physiological processes. In the gut, the ingested protein is split into an insoluble peptide (para kappa-casein) and a soluble hydrophilic glycopeptide (caseinomacro 12995 Protein kinase domain 12995 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 12999 Protein kinase domain 13000 Protein kinase domain 13001 Casein kinase II regulatory subunit 13003 Extracellular link domain 13003 Sushi domain (SCR repeat) 13003 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 13004 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 13007 LIM domain. This family represents two copies of the LIM structural domain 13008 LIM domain. This family represents two copies of the LIM structural domain 13009 LIM domain. This family represents two copies of the LIM structural domain 13012 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 13012 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 13014 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 13030 Papain family cysteine protease 13032 Papain family cysteine protease 13033 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 13034 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 13035 Trypsin 13036 Papain family cysteine protease 13041 Papain family cysteine protease 13043 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 13047 Homeobox domain 13047 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 13048 Homeobox domain 13048 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 13051 7 transmembrane receptor (rhodopsin family) 13056 Cytochrome b561 13058 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 13070 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13072 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13074 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13075 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13076 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13077 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13078 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13079 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13081 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13082 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13085 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13086 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13087 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13088 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13089 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13090 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13094 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13095 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13096 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13096 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13096 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 13097 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13098 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13099 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13099 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13101 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13105 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13106 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13107 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13108 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13109 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13110 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13112 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13113 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13114 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13115 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13116 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13117 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13118 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13119 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13120 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13121 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13122 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13123 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13124 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 13132 Phosphotyrosine interaction domain (PTB/PID) 13134 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 13135 DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon s 13136 Sushi domain (SCR repeat) 13137 Sushi domain (SCR repeat) 13139 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 13139 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 13142 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 13143 Protein kinase domain 13162 Sodium:neurotransmitter symporter family 13163 Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined 13166 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 13166 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 13167 Acyl CoA binding protein 13168 Acyl CoA binding protein 13172 Homeobox domain 13176 Fibronectin type III domain 13177 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 13178 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 13180 Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha) 13184 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 13185 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 13190 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 13191 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 13194 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 13195 Pyridoxal-dependent decarboxylase conserved domain 13196 PH domain. PH stands for pleckstrin homology 13196 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 13196 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 13196 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 13196 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 13197 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 13200 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kD subunit. Members of this family are involved in asparagine-linked protein glycosylation. In particular, dolichyl-diphosphooligosaccharide-protein glycosyltransferase (DDOST), also known a 13202 Macrophage migration inhibitory factor (MIF) 13206 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 13206 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 13209 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 13209 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 13215 Beta defensin 13216 Mammalian defensin 13216 Defensin propeptide 13218 Defensin propeptide 13221 Defensin propeptide 13222 Defensin propeptide 13237 Mammalian defensin 13237 Defensin propeptide 13238 Defensin propeptide 13238 Defensin propeptide 13239 Mammalian defensin 13239 Defensin propeptide 13240 Mammalian defensin 13240 Defensin propeptide 13244 Fatty acid desaturase 13340 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 13345 Helix-loop-helix DNA-binding domain 13346 Intermediate filament protein 13346 Intermediate filament protein 13347 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 13358 Mitochondrial carrier protein 13358 Mitochondrial carrier protein 13358 Mitochondrial carrier protein 13360 Ergosterol biosynthesis ERG4/ERG24 family 13361 Dihydrofolate reductase 13363 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 13363 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 13368 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 13370 Iodothyronine deiodinase 13371 Iodothyronine deiodinase 13384 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 13388 Delta serrate ligand 13390 Homeobox domain 13392 Homeobox domain 13393 Homeobox domain 13394 Homeobox domain 13401 WD domain, G-beta repeat 13405 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 13405 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 13411 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 13417 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 13421 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 13423 Deoxyribonuclease II 13424 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 13424 Actin interacting protein 3 13424 Uncharacterized ACR, COG1579 13424 Type IV secretion system CagX conjugation protein 13424 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 13424 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 13424 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 13429 Dynamin family 13429 Dynamin GTPase effector domain 13429 PH domain. PH stands for pleckstrin homology 13429 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 13430 Dynamin GTPase effector domain 13430 PH domain. PH stands for pleckstrin homology 13430 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 13433 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 13433 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 13434 C-5 cytosine-specific DNA methylase 13435 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 13435 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 13436 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 13437 Aminopeptidase I zinc metalloprotease (M18) 13446 C2 domain 13447 C2 domain 13449 PTB domain (IRS-1 type) 13476 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 13476 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 13478 Glycosyl transferase 13479 Renal dipeptidase 13480 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 13482 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 13483 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 13487 Sulfate transporter family. Mutations may lead to several human diseases 13487 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 13488 7 transmembrane receptor (rhodopsin family) 13489 7 transmembrane receptor (rhodopsin family) 13490 7 transmembrane receptor (rhodopsin family) 13491 7 transmembrane receptor (rhodopsin family) 13492 7 transmembrane receptor (rhodopsin family) 13492 7 transmembrane receptor (rhodopsin family) 13496 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 13497 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 13497 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 13497 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 13498 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 13505 Cadherin domain 13506 Cadherin domain 13507 Cadherin domain 13507 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 13508 Fibronectin type III domain 13511 Cadherin domain 13512 Cadherin domain 13512 Cadherin domain 13512 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 13516 Leucine rich repeat N-terminal domain 13518 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 13518 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 13521 Sulfate transporter family. Mutations may lead to several human diseases 13522 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 13522 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 13524 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 13524 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 13525 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 13525 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 13526 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 13526 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 13529 Somatotropin hormone family 13537 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 13542 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 13542 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 13542 Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778) 13542 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 13543 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 13543 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 13543 Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778) 13543 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 13544 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 13544 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 13544 Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778) 13544 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 13548 Protein kinase domain 13548 Protein kinase domain 13555 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 13557 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 13559 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 13586 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 13590 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 13591 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 13592 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 13593 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 13602 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 13603 7 transmembrane receptor (rhodopsin family) 13605 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 13608 Death domain 13609 7 transmembrane receptor (rhodopsin family) 13610 7 transmembrane receptor (rhodopsin family) 13611 7 transmembrane receptor (rhodopsin family) 13612 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 13612 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 13612 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 13614 Endothelin family 13615 Endothelin family 13616 Endothelin family 13617 7 transmembrane receptor (rhodopsin family) 13618 7 transmembrane receptor (rhodopsin family) 13619 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 13627 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 13629 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 13631 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 13636 Ephrin 13637 Ephrin 13638 Ephrin 13639 Ephrin 13640 Ephrin 13641 Ephrin 13642 Ephrin 13643 Ephrin 13649 Furin-like cysteine rich region 13649 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 13650 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 13650 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. These proteins contain three strongly conserved histidines in the putative trans 13650 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 13650 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. These proteins contain three strongly conserved histidines in the putative trans 13650 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 13664 Eukaryotic initiation factor 1A 13667 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 13669 V-type ATPase 116kDa subunit family 13684 Eukaryotic initiation factor 4E 13690 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 13706 Trypsin 13712 Ets-domain 13712 Ets-domain 13713 Ets-domain 13714 Ets-domain 13716 Occludin/ELL family 13722 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act 13726 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 13726 Translation initiation factor SUI1 13726 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 13728 Kinase associated domain 1 13730 PMP-22/EMP/MP20/Claudin family 13731 PMP-22/EMP/MP20/Claudin family 13732 PMP-22/EMP/MP20/Claudin family 13733 7 transmembrane receptor (Secretin family) 13733 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 13796 Homeobox domain 13797 Homeobox domain 13798 Homeobox domain 13799 Homeobox domain 13800 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 13803 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 13809 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 13813 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 13821 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 13822 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 13822 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 13823 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 13824 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 13828 Transglutaminase family 13828 Transglutaminase family, C-terminal ig like domain 13828 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 13830 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 13835 Fibronectin type III domain 13835 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13836 Fibronectin type III domain 13836 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13837 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13837 Protein kinase domain 13837 Fibronectin type III domain 13837 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13837 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 13838 Fibronectin type III domain 13838 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13839 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13839 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 13839 Protein kinase domain 13839 Fibronectin type III domain 13839 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13839 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 13840 Fibronectin type III domain 13840 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13841 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13841 Protein kinase domain 13841 Fibronectin type III domain 13841 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13841 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 13842 Fibronectin type III domain 13842 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13845 Protein kinase domain 13845 Fibronectin type III domain 13845 Giardia variant-specific surface protein 13845 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13845 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 13846 Fibronectin type III domain 13846 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13846 Protein kinase domain 13846 Fibronectin type III domain 13846 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13846 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 13848 Fibronectin type III domain 13848 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 13849 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 13850 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 13852 Syntaxin 13853 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 13855 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 13856 Erythropoietin/thrombopoietin 13856 Erythropoietin/thrombopoietin 13856 Erythropoietin/thrombopoietin 13861 Animal haem peroxidase 13863 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 13864 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 13864 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 13865 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 13865 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 13866 Furin-like cysteine rich region 13866 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 13866 Protein kinase domain 13866 Furin-like cysteine rich region 13866 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 13867 Protein kinase domain 13867 Furin-like cysteine rich region 13869 Furin-like cysteine rich region 13869 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 13875 Ets-domain 13876 Ets-domain 13876 Sterile alpha motif (SAM)/Pointed domain 13876 Ets-domain 13876 Sterile alpha motif (SAM)/Pointed domain 13877 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 13884 Carboxylesterase 13897 Carboxylesterase 13909 Carboxylesterase 13982 Oestrogen receptor 13982 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 13982 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 13983 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 13983 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 13984 Homeobox domain 13990 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 13990 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 13990 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 13998 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 14008 Ets-domain 14009 Ets-domain 14011 Ets-domain 14011 Sterile alpha motif (SAM)/Pointed domain 14013 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 14025 Zinc finger, C2H2 type 14026 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 14027 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 14028 Homeobox domain 14029 Homeobox domain 14042 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 14043 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 14048 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 14049 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 14050 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 14051 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t 14055 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 14056 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 14057 Tricarboxylate carrier 14060 Sushi domain (SCR repeat) 14061 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 14062 7 transmembrane receptor (rhodopsin family) 14063 7 transmembrane receptor (rhodopsin family) 14064 7 transmembrane receptor (rhodopsin family) 14065 7 transmembrane receptor (rhodopsin family) 14066 Tissue factor 14067 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 14067 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 14069 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 14069 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 14071 Trypsin 14071 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 14071 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 14071 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 14073 Amidase 14077 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 14079 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 14080 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 14082 Death effector domain 14083 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 14085 Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the ty 14087 Fanconi anaemia group A protein 14088 Fanconi anaemia group C protein 14089 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 14102 TNFR/NGFR cysteine-rich region 14103 TNF(Tumor Necrosis Factor) family 14104 Zinc-binding dehydrogenase 14104 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 14104 Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid th 14104 Acyl transferase domain 14104 Zinc-binding dehydrogenase 14104 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 14104 Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains 14104 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 14104 Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu 14104 Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thi 14107 Cadherin domain 14107 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 14113 Fibrillarin 14118 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 14118 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 14119 Cadherin domain 14119 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 14120 Fructose-1-6-bisphosphatase 14121 Fructose-1-6-bisphosphatase 14126 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 14132 Class I Histocompatibility antigen, domains alpha 1 and 2 14133 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 14134 Fibrinogen beta and gamma chains, C-terminal globular domain 14134 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 14137 Squalene/phytoene synthase 14137 Squalene/phytoene synthase 14151 Ferrochelatase 14154 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 14154 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 14156 XPG I-region 14156 XPG N-terminal domain 14156 5'-3' exonuclease, C-terminal SAM fold 14156 XPG I-region 14156 XPG N-terminal domain 14156 5'-3' exonuclease, C-terminal SAM fold 14158 SH2 domain 14159 SH2 domain 14159 Protein kinase domain 14160 7 transmembrane receptor (rhodopsin family) 14163 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 14163 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 14164 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14167 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 14171 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14172 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14173 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14174 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14175 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14176 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14177 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 14178 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14179 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14180 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 14186 Protein kinase domain 14186 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14187 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 14190 Fibrinogen beta and gamma chains, C-terminal globular domain 14190 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 14191 SH2 domain 14191 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 14194 Lyase 14198 HIT family 14200 LIM domain. This family represents two copies of the LIM structural domain 14201 LIM domain. This family represents two copies of the LIM structural domain 14204 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 14208 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 14219 Thrombospondin type 1 domain 14227 FKBP-type peptidyl-prolyl cis-trans isomerase 14228 FKBP-type peptidyl-prolyl cis-trans isomerase 14229 FKBP-type peptidyl-prolyl cis-trans isomerase 14230 FKBP-type peptidyl-prolyl cis-trans isomerase 14231 FKBP-type peptidyl-prolyl cis-trans isomerase 14232 FKBP-type peptidyl-prolyl cis-trans isomerase 14234 Fork head domain 14236 Fork head domain 14236 Fork head domain 14237 Fork head domain 14239 Fork head domain 14240 Fork head domain 14247 Ets-domain 14251 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 14252 Flotillin family. Flotillins are integral membrane proteins that have been shown to be present in several subcellular components, including caveolae (invaginated plasma membrane microdomains), lipid rafts (sphingolipid and cholesterol-rich, detergent-res 14255 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 14255 Protein kinase domain 14255 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14256 pfam02947, flt3_lig, flt3 ligand. The flt3 ligand is a short chain cytokine with a 4 helical bundle fold 14261 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 14262 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 14263 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 14265 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 14265 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 14265 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 14268 Fibronectin type I domain 14268 Fibronectin type II domain 14268 Fibronectin type III domain 14269 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 14270 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 14270 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 14270 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 14275 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 14276 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 14287 Mur ligase family, catalytic domain. This family contains a number of related ligase enzymes which have EC numbers 6.3.2.*. This family includes: MurC, MurD, MurE, MurF, Mpl and FolC. MurC, MurD, Mure and MurF catalyse consecutive steps in the synthesis o 14288 7 transmembrane receptor (rhodopsin family) 14289 7 transmembrane receptor (rhodopsin family) 14290 7 transmembrane receptor (rhodopsin family) 14291 7 transmembrane receptor (rhodopsin family) 14293 7 transmembrane receptor (rhodopsin family) 14294 7 transmembrane receptor (rhodopsin family) 14297 Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain 14309 7 transmembrane receptor (rhodopsin family) 14311 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 14311 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 14319 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 14325 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 14337 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 14343 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 14344 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 14345 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 14347 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 14348 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 14352 ENV polyprotein (coat polyprotein) 14356 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 14357 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 14359 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 14360 SH2 domain 14360 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 14362 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14365 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14366 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14367 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14367 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 14367 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 14368 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14369 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14370 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 14371 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 14371 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 14376 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 14387 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 14390 Ets-domain 14394 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14394 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14395 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14395 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14396 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14396 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14397 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14397 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14397 Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14397 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14397 Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14397 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14399 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14399 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14400 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14400 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14401 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14401 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14402 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14402 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14403 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14403 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14404 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14404 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14405 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14405 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14406 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14406 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14406 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14406 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14407 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14407 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14408 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14408 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14409 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14409 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14411 Sodium:neurotransmitter symporter family 14412 Sodium:neurotransmitter symporter family 14415 Pyridoxal-dependent decarboxylase conserved domain 14417 Pyridoxal-dependent decarboxylase conserved domain 14420 Glycosyl hydrolase family 59 14421 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 14422 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 14423 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 14425 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 14426 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 14427 7 transmembrane receptor (rhodopsin family) 14428 7 transmembrane receptor (rhodopsin family) 14429 7 transmembrane receptor (rhodopsin family) 14432 IQ calmodulin-binding motif. Calmodulin-binding motif 14433 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 14433 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 14457 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 14457 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 14457 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 14459 Gastrin/cholecystokinin family 14461 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d 14465 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 14466 O-Glycosyl hydrolase family 30 14467 ssDNA binding protein 14468 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 14468 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 14469 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 14469 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 14470 Prenylated rab acceptor (PRA1) 14470 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 14472 Homeobox domain 14473 Serum albumin family 14526 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 14526 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 14527 7 transmembrane receptor (Secretin family) 14531 GCM motif protein 14536 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 14536 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 14536 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 14536 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 14537 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 14537 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 14538 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 14538 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 14539 7 transmembrane receptor (rhodopsin family) 14555 NAD-dependent glycerol-3-phosphate dehydrogenase 14562 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 14563 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 14567 GDP dissociation inhibitor 14568 GDP dissociation inhibitor 14569 GDP dissociation inhibitor 14570 RHO protein GDP dissociation inhibitor 14573 Transforming growth factor beta like domain 14580 Intermediate filament protein 14583 Glutamine amidotransferases class-II 14583 SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Presumably the SIS 14585 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 14585 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron 14586 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 14587 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 14588 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 14593 Polyprenyl synthetase 14594 Glycosyltransferase family 6 14595 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 14598 Gamma-glutamyltranspeptidase 14599 Somatotropin hormone family 14600 Fibronectin type III domain 14601 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 14602 7 transmembrane receptor (Secretin family) 14602 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 14603 Eukaryotic cobalamin-binding protein 14605 TSC-22/dip/bun family 14607 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 14608 7 transmembrane receptor (rhodopsin family) 14609 Connexin 14609 Gap junction alpha-1 protein (Cx43) 14610 Connexin 14611 Connexin 14612 Connexin 14613 Connexin 14615 Connexin 14616 Connexin 14616 Gap junction alpha-8 protein (Cx50) 14617 Connexin 14618 Connexin 14619 Connexin 14620 Connexin 14621 Connexin 14622 Connexin 14623 Connexin 14625 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 14625 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 14629 Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. T 14629 Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. Th 14645 Glutamine synthetase, catalytic domain 14651 Metallo-beta-lactamase superfamily 14652 7 transmembrane receptor (Secretin family) 14654 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14654 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14657 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14657 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14658 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 14658 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 14664 Sodium:neurotransmitter symporter family 14670 GTPase of unknown function 14672 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14673 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14674 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14675 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14676 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14678 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14679 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14679 ADP-ribosylation factor family 14679 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14680 ADP-ribosylation factor family 14680 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14681 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14682 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14683 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14683 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14685 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14686 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14687 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 14693 WD domain, G-beta repeat 14693 WD domain, G-beta repeat 14699 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 14715 7 transmembrane receptor (rhodopsin family) 14724 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 14725 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 14729 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 14731 Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly 14733 Glypican 14734 Glypican 14735 Glypican 14738 7 transmembrane receptor (rhodopsin family) 14739 7 transmembrane receptor (rhodopsin family) 14744 7 transmembrane receptor (rhodopsin family) 14745 7 transmembrane receptor (rhodopsin family) 14745 7 transmembrane receptor (rhodopsin family) 14747 7 transmembrane receptor (rhodopsin family) 14747 7 transmembrane receptor (rhodopsin family) 14748 7 transmembrane receptor (rhodopsin family) 14751 Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate 14758 Myelin proteolipid protein (PLP or lipophilin) 14760 7 transmembrane receptor (rhodopsin family) 14761 7 transmembrane receptor (rhodopsin family) 14762 7 transmembrane receptor (rhodopsin family) 14762 7 transmembrane receptor (rhodopsin family) 14763 7 transmembrane receptor (rhodopsin family) 14763 7 transmembrane receptor (rhodopsin family) 14763 7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand 14763 7 transmembrane receptor (rhodopsin family) 14763 7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand 14764 7 transmembrane receptor (rhodopsin family) 14765 7 transmembrane receptor (rhodopsin family) 14766 7 transmembrane receptor (Secretin family) 14767 7 transmembrane receptor (rhodopsin family) 14772 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 14773 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 14775 Glutathione peroxidase 14776 Glutathione peroxidase 14778 Glutathione peroxidase 14780 Glutathione peroxidase 14784 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 14784 SH2 domain 14784 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 14786 SH2 domain 14786 PH domain. PH stands for pleckstrin homology 14787 Hr1 repeat 14787 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 14788 7 transmembrane receptor (rhodopsin family) 14791 Suppressor Mra1. The suppressor Mra1 is found in high-copy-number when Ras1 is mutated, that recovers the mating deficiency caused by the decrease of Ras1 activity. Mutational analysis in yeast suggests that the suppressor Mra1 is essential for cell grow 14792 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 14797 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 14799 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14800 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14803 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14804 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14805 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14806 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14807 Bacterial extracellular solute-binding proteins, family 3 14807 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 14809 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14809 Bacterial extracellular solute-binding proteins, family 3 14809 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 14809 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14810 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14811 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14812 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14813 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14814 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14815 Glucocorticoid receptor 14815 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 14815 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 14816 7 transmembrane receptor (metabotropic glutamate family) 14816 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14823 7 transmembrane receptor (metabotropic glutamate family) 14823 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14825 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 14827 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 14828 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 14829 7 transmembrane receptor (rhodopsin family) 14836 Homeobox domain 14842 Homeobox domain 14843 Homeobox domain 14854 Eukaryotic glutathione synthase 14854 Eukaryotic glutathione synthase, ATP binding domain 14857 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14858 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14859 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14860 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14862 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14863 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14864 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14865 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14866 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14866 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14866 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14866 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14866 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14867 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14869 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14870 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 14872 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14872 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14872 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14872 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 14884 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 14885 Transcription factor Tfb2 14886 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 14911 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 14912 Homeobox domain 14915 Guanylin precursor 14916 Guanylin precursor 14917 Protein kinase domain 14917 Adenylate and Guanylate cyclase catalytic domain 14919 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 14924 Guanylate kinase 14924 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 14924 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 14933 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 14933 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 14934 Glycophorin A 14936 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 14937 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 14938 Trypsin 14940 Trypsin 14942 Trypsin 14950 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 14957 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 14958 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 14958 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 14960 Class II histocompatibility antigen, alpha domain 14961 Class II histocompatibility antigen, beta domain 14962 Sushi domain (SCR repeat) 14962 von Willebrand factor type A domain 14962 Trypsin 14962 Sushi domain (SCR repeat) 14962 von Willebrand factor type A domain 14962 Trypsin 14962 Sushi domain (SCR repeat) 14962 von Willebrand factor type A domain 14963 Class I Histocompatibility antigen, domains alpha 1 and 2 14964 Class I Histocompatibility antigen, domains alpha 1 and 2 14964 Class I Histocompatibility antigen, domains alpha 1 and 2 14964 Class I Histocompatibility antigen, domains alpha 1 and 2 14967 Class I Histocompatibility antigen, domains alpha 1 and 2 14968 Class II histocompatibility antigen, alpha domain 14968 Class II histocompatibility antigen, alpha domain 14968 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14968 Class II histocompatibility antigen, alpha domain 14968 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14969 Class II histocompatibility antigen, beta domain 14969 Class II histocompatibility antigen, beta domain 14969 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14969 Class II histocompatibility antigen, beta domain 14969 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14972 Class I Histocompatibility antigen, domains alpha 1 and 2 14972 Class I Histocompatibility antigen, domains alpha 1 and 2 14976 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 14977 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 14979 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 14985 Class I Histocompatibility antigen, domains alpha 1 and 2 14985 Class I Histocompatibility antigen, domains alpha 1 and 2 14985 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14991 Class I Histocompatibility antigen, domains alpha 1 and 2 14997 Class I Histocompatibility antigen, domains alpha 1 and 2 14997 Class I Histocompatibility antigen, domains alpha 1 and 2 14997 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14997 Class I Histocompatibility antigen, domains alpha 1 and 2 14997 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 14998 Class II histocompatibility antigen, alpha domain 14999 Class II histocompatibility antigen, beta domain 14999 Class II histocompatibility antigen, beta domain 15000 Class II histocompatibility antigen, beta domain 15001 Class II histocompatibility antigen, alpha domain 15001 Class II histocompatibility antigen, alpha domain 15001 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 15001 Class II histocompatibility antigen, alpha domain 15001 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 15002 Class II histocompatibility antigen, beta domain 15006 Class I Histocompatibility antigen, domains alpha 1 and 2 15006 Class I Histocompatibility antigen, domains alpha 1 and 2 15006 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 15007 Class I Histocompatibility antigen, domains alpha 1 and 2 15007 Class I Histocompatibility antigen, domains alpha 1 and 2 15013 Class I Histocompatibility antigen, domains alpha 1 and 2 15016 Class I Histocompatibility antigen, domains alpha 1 and 2 15018 Class I Histocompatibility antigen, domains alpha 1 and 2 15019 Class I Histocompatibility antigen, domains alpha 1 and 2 15024 Class I Histocompatibility antigen, domains alpha 1 and 2 15039 Class I Histocompatibility antigen, domains alpha 1 and 2 15040 Class I Histocompatibility antigen, domains alpha 1 and 2 15042 Class I Histocompatibility antigen, domains alpha 1 and 2 15043 Class I Histocompatibility antigen, domains alpha 1 and 2 15051 Class I Histocompatibility antigen, domains alpha 1 and 2 15064 Class I Histocompatibility antigen, domains alpha 1 and 2 15108 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15109 Phenylalanine and histidine ammonia-lyase 15111 Helix-loop-helix DNA-binding domain 15112 FMN-dependent dehydrogenase 15117 Chitin synthase. Members of this family are fungal chitin synthase EC:2.4.1.16 enzymes. They catalyse chitin synthesis as follows: UDP-N-acetyl-D-glucosamine + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1 15118 Chitin synthase. Members of this family are fungal chitin synthase EC:2.4.1.16 enzymes. They catalyse chitin synthesis as follows: UDP-N-acetyl-D-glucosamine + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1 15122 Globin 15126 Globin 15129 Globin 15130 Globin 15132 Globin 15135 Globin 15139 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 15139 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 15139 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 15139 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 15159 Cytochrome c/c1 heme lyase 15160 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 15162 SH2 domain 15162 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 15163 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 15165 Cyclic nucleotide-binding domain 15165 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 15166 Cyclic nucleotide-binding domain 15166 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 15168 Cyclic nucleotide-binding domain 15168 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 15170 SH2 domain 15170 SH2 domain 15170 Protein-tyrosine phosphatase 15170 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 15170 SH2 domain 15170 Protein-tyrosine phosphatase 15170 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 15171 Prepro-orexin 15181 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 15182 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 15185 Zn-finger in ubiquitin-hydrolases and other protein 15186 Pyridoxal-dependent decarboxylase conserved domain 15191 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 15192 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 15193 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 15194 Huntingtin 15203 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 15203 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 15204 Anaphase-promoting complex, subunit 10 (APC10) 15204 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 15205 Helix-loop-helix DNA-binding domain 15206 Helix-loop-helix DNA-binding domain 15208 Helix-loop-helix DNA-binding domain 15209 Homeobox domain 15211 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 15212 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 15213 Helix-loop-helix DNA-binding domain 15214 Helix-loop-helix DNA-binding domain 15216 Class I Histocompatibility antigen, domains alpha 1 and 2 15221 Fork head domain 15223 Fork head domain 15223 Fork head domain 15228 Fork head domain 15229 Fork head domain 15233 homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme 15234 Trypsin 15234 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 15234 PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some t 15235 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 15235 PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some 15239 VHS domain. Domain present in VPS-27, Hrs and STAM 15242 Homeobox domain 15248 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 15254 HIT family 15254 HIT family 15257 Protein kinase domain 15260 WD domain, G-beta repeat 15273 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 15275 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 15277 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 15284 Homeobox domain 15285 Homeobox domain 15288 Porphobilinogen deaminase, C-terminal domain 15288 Porphobilinogen deaminase, dipyromethane cofactor binding domain 15289 HMG (high mobility group) box 15353 HMG (high mobility group) box 15354 HMG (high mobility group) box 15356 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 15356 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 15357 Hydroxymethylglutaryl-coenzyme A reductase 15357 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 15360 Hydroxymethylglutaryl-coenzyme A synthase 15368 Heme oxygenase 15369 Heme oxygenase 15370 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 15370 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 15371 Homeobox domain 15372 Homeobox domain 15373 Homeobox domain 15373 Homeobox domain 15375 Fork head domain 15376 Fork head domain 15377 Fork head domain 15378 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 15378 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 15379 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 15388 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 15394 Homeobox domain 15395 Homeobox domain 15396 Homeobox domain 15398 Homeobox domain 15399 Homeobox domain 15402 Homeobox domain 15403 Homeobox domain 15404 Homeobox domain 15405 Homeobox domain 15407 Homeobox domain 15408 Homeobox domain 15410 Homeobox domain 15412 Homeobox domain 15413 Homeobox domain 15414 Homeobox domain 15415 Homeobox domain 15416 Homeobox domain 15416 Homeobox domain 15421 Homeobox domain 15422 Homeobox domain 15423 Homeobox domain 15424 Homeobox domain 15425 Homeobox domain 15425 Homeobox domain 15426 Homeobox domain 15427 Homeobox domain 15429 Homeobox domain 15429 Homeobox domain 15430 Homeobox domain 15431 Homeobox domain 15432 Homeobox domain 15433 Homeobox domain 15434 Homeobox domain 15436 Homeobox domain 15437 Homeobox domain 15438 Homeobox domain 15439 Sushi domain (SCR repeat) 15441 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 15442 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular 15446 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15450 Lipase 15450 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 15458 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 15460 jmjC domain 15463 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 15465 7 transmembrane receptor (rhodopsin family) 15465 7 transmembrane receptor (rhodopsin family) 15466 7 transmembrane receptor (rhodopsin family) 15467 Protein kinase domain 15473 Endoribonuclease L-PSP. Endoribonuclease active on single-stranded mRNA. Inhibits protein synthesis by cleavage of mRNA. Previously thought to inhibit protein synthesis initiation. This protein may also be involved in the regulation of purine biosynthesi 15476 Sulfotransferase protein 15478 Sulfotransferase protein 15481 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 15482 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 15482 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 15482 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 15482 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 15482 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 15483 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15484 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15485 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15486 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15486 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15487 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15488 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15488 MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes 15488 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 15490 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 15492 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 15493 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 15493 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 15494 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 15494 Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI) 15494 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 15495 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 15495 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 15495 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 15496 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 15497 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 15499 HSF-type DNA-binding 15499 HSF-type DNA-binding domain 15500 HSF-type DNA-binding 15505 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 15507 Hsp20/alpha crystallin family 15512 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 15516 Hsp90 protein 15519 Hsp90 protein 15519 Hsp90 protein 15519 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 15525 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 15526 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 15528 Chaperonin 10 Kd subunit 15529 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 15530 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 15530 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 15531 Sulfotransferase protein 15547 Conserved hypothetical protein 95 15547 ubiE/COQ5 methyltransferase family 15547 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 15547 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 15547 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 15547 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 15547 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 15547 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 15550 7 transmembrane receptor (rhodopsin family) 15551 7 transmembrane receptor (rhodopsin family) 15552 7 transmembrane receptor (rhodopsin family) 15557 7 transmembrane receptor (rhodopsin family) 15558 7 transmembrane receptor (rhodopsin family) 15559 7 transmembrane receptor (rhodopsin family) 15560 7 transmembrane receptor (rhodopsin family) 15561 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 15561 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 15562 7 transmembrane receptor (rhodopsin family) 15563 7 transmembrane receptor (rhodopsin family) 15564 7 transmembrane receptor (rhodopsin family) 15565 7 transmembrane receptor (rhodopsin family) 15566 7 transmembrane receptor (rhodopsin family) 15567 Sodium:neurotransmitter symporter family 15567 Serotonin (5-HT) neurotransmitter transporter, N-terminus 15571 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 15574 Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a 15586 Hyaluronidase 15587 Hyaluronidase 15598 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic d 15894 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 15896 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 15898 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 15900 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 15901 Helix-loop-helix DNA-binding domain 15901 Helix-loop-helix DNA-binding domain 15902 Helix-loop-helix DNA-binding domain 15930 Indoleamine 2,3-dioxygenase 15931 Sulfatase 15932 Glycosyl hydrolases family 39 15932 Glycosyl hydrolases family 39 15945 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 15950 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 15950 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 15951 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 15951 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 15962 Interferon alpha/beta domain 15964 Interferon alpha/beta domain 15965 Interferon alpha/beta domain 15967 Interferon alpha/beta domain 15968 Interferon alpha/beta domain 15969 Interferon alpha/beta domain 15970 Interferon alpha/beta domain 15972 Interferon alpha/beta domain 15974 Interferon alpha/beta domain 15975 Tissue factor 15976 Tissue factor 15977 Interferon alpha/beta domain 15978 Interferon gamma 15978 Interferon gamma 15979 Tissue factor 15980 Tissue factor 15980 Fibronectin type III domain 16001 Protein kinase domain 16001 Fibronectin type III domain 16001 Furin-like cysteine rich region 16001 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 16004 Cation-independent mannose-6-phosphate receptor repeat. The cation-independent mannose-6-phosphate receptor contains 15 copies of a repeat 16005 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 16006 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 16007 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 16008 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 16009 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 16010 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 16011 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 16012 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 16061 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16065 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16098 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16114 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16117 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16147 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 16147 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 16149 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 16150 Protein kinase domain 16153 Interleukin 10 16155 Tissue factor 16159 Interleukin-12 alpha subunit. Interleukin 12 (IL-12) is a disulphide-bonded heterodimer consisting of a 35kDa alpha subunit and a 40kDa beta subunit. It is involved in the stimulation and maintenance of Th1 cellular immune responses, including the normal 16161 Fibronectin type III domain 16162 Fibronectin type III domain 16168 Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses 16173 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 16174 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 16177 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 16179 Protein kinase domain 16180 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 16182 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 16183 Interleukin 2 16183 Interleukin 2 16183 Interleukin 2 16184 Sushi domain (SCR repeat) 16187 Interleukin-3 16189 Interleukin 4 16191 Interleukin 5 16197 Fibronectin type III domain 16198 Interleukin 7/9 family. IL-7 is a cytokine that acts as a growth factor for early lymphoid cells of both B- and T-cell lineages. IL-9 is a multi-functional cytokine that, although originally described as a T-cell growth factor, its function in T-cell res 16202 Protein kinase domain 16204 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 16210 Uncharacterized protein family UPF0029 16319 Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregati 16319 Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregatio 16322 Transforming growth factor beta like domain 16323 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 16329 Inositol monophosphatase family 16330 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 16330 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 16332 SH2 domain 16332 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 16333 Insulin/IGF/Relaxin family. Superfamily includes insulins 16334 Insulin/IGF/Relaxin family. Superfamily includes insulins 16336 Insulin/IGF/Relaxin family. Superfamily includes insulins 16337 Fibronectin type III domain 16337 Furin-like cysteine rich region 16337 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 16341 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 16348 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 16348 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 16362 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 16363 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 16364 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 16365 MmgE/PrpD family. This family includes 2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is required for propionate catabolism. It catalyses the third step of the 2-methylcitric acid cycle 16367 PTB domain (IRS-1 type) 16369 PTB domain (IRS-1 type) 16370 PTB domain (IRS-1 type) 16371 Homeobox domain 16372 Homeobox domain 16391 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 16392 LIM domain. This family represents two copies of the LIM structural domain 16396 C2 domain 16396 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 16396 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 16398 von Willebrand factor type A domain 16402 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 16403 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 16407 von Willebrand factor type A domain 16407 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 16407 von Willebrand factor type A domain 16408 von Willebrand factor type A domain 16408 von Willebrand factor type A domain 16408 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 16409 von Willebrand factor type A domain 16411 von Willebrand factor type A domain 16412 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16412 Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF 16414 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16415 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16416 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16418 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues 16419 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16420 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16421 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 16428 SH2 domain 16428 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 16430 Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase 16431 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 16432 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 16438 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 16438 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16439 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 16449 Delta serrate ligand 16450 Delta serrate ligand 16468 jmjC domain 16468 jmjN domain 16468 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 16468 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 16469 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 16469 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 16476 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 16476 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 16477 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 16477 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 16478 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 16478 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 16480 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 16480 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 16485 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16485 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16490 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16490 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16491 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16491 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16492 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16492 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16493 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16493 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16494 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16494 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16495 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16495 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16499 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 16500 Kv2 voltage-gated K+ channel 16500 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16500 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16502 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16502 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16504 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16504 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16506 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16506 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16508 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16508 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16509 Slow voltage-gated potassium channel 16510 Cyclic nucleotide-binding domain 16510 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 16510 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16511 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 16511 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16512 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 16512 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16513 Inward rectifier potassium channel 16514 Inward rectifier potassium channel 16515 Inward rectifier potassium channel 16516 Inward rectifier potassium channel 16516 Inward rectifier potassium channel 16517 Inward rectifier potassium channel 16518 Inward rectifier potassium channel 16519 Inward rectifier potassium channel 16520 Inward rectifier potassium channel 16521 Inward rectifier potassium channel 16522 Inward rectifier potassium channel 16522 Inward rectifier potassium channel 16522 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16523 Inward rectifier potassium channel 16524 Inward rectifier potassium channel 16525 TASK K+ channel 16526 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16528 TASK K+ channel 16529 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16529 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16530 TASK K+ channel 16531 Calcium-activated BK potassium channel alpha subunit 16531 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16532 Calcium-activated BK potassium channel alpha subunit 16532 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16533 Calcium-activated potassium channel, beta subunit 16534 Calcium-activated SK potassium channel 16534 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 16535 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 16535 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16536 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16538 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16538 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16539 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16539 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16539 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 16539 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 16541 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 16548 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 16549 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 16549 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 16552 Kinesin motor domain 16552 Kinesin motor domain 16559 Kinesin motor domain 16560 Kinesin motor domain 16560 PH domain. PH stands for pleckstrin homology 16560 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 16561 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 16561 Kinesin motor domain 16561 PH domain. PH stands for pleckstrin homology 16561 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 16562 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 16564 Kinesin motor domain 16568 Kinesin motor domain 16568 Kinesin motor domain 16569 Kinesin motor domain 16570 Kinesin motor domain 16572 Kinesin motor domain 16573 Kinesin motor domain 16574 Kinesin motor domain 16578 Kinesin motor domain 16578 Kinesin motor domain 16589 Protein kinase domain 16590 Protein kinase domain 16590 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16591 Glycosyl hydrolase family 1 16592 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 16618 Trypsin 16625 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 16644 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 16656 Zinc finger, C2H2 type 16658 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 16660 Intermediate filament protein 16661 Intermediate filament protein 16663 Intermediate filament protein 16664 Intermediate filament protein 16665 Intermediate filament protein 16666 Intermediate filament protein 16667 Intermediate filament protein 16668 Intermediate filament protein 16669 Intermediate filament protein 16670 Intermediate filament protein 16671 Intermediate filament protein 16671 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 16672 Intermediate filament protein 16673 Intermediate filament protein 16675 Intermediate filament protein 16678 Intermediate filament protein 16679 Intermediate filament protein 16679 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 16680 Intermediate filament protein 16680 Intermediate filament protein 16681 Intermediate filament protein 16682 Intermediate filament protein 16687 Intermediate filament protein 16688 Intermediate filament protein 16691 Intermediate filament protein 16706 Protein kinase domain 16728 Fibronectin type III domain 16765 Stathmin family 16769 Cadherin domain 16769 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 16770 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme 16772 Laminin G domain 16772 Laminin B (Domain IV) 16772 Laminin N-terminal (Domain VI) 16772 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16773 Laminin G domain 16773 Laminin N-terminal (Domain VI) 16773 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16773 Tropomyosin 16773 Laminin G domain 16773 Laminin B (Domain IV) 16773 Uncharacterized ACR, COG1579 16773 Intermediate filament protein 16773 Laminin N-terminal (Domain VI) 16773 Giardia variant-specific surface protein 16773 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16773 Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production 16773 Methyl-accepting chemotaxis protein (MCP) signaling domain. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs 16773 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 16773 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 16773 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 16773 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 16773 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 16773 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 16774 Laminin B (Domain IV) 16774 Ribosomal protein L15 16774 Laminin N-terminal (Domain VI) 16774 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16775 Intermediate filament protein 16775 Thrombospondin N-terminal -like domain 16775 Giardia variant-specific surface protein 16775 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16775 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 16776 Laminin B (Domain IV) 16776 HlyD family secretion protein 16776 Intermediate filament protein 16776 Laminin N-terminal (Domain VI) 16776 Borrelia outer surface protein D (OspD) 16776 Giardia variant-specific surface protein 16776 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16776 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 16776 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E 16776 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a 16776 Rotavirus major capsid protein VP6. Rotaviruses consist of three concentric protein shells. The intermediate (middle) protein layer consists 260 trimers of VP6. VP6 in the most abundant protein in the virion. VP6 is also involved in virion assembly, and p 16776 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 16779 Laminin N-terminal (Domain VI) 16779 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16780 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16780 Laminin N-terminal (Domain VI) 16780 Giardia variant-specific surface protein 16780 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16782 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 16783 Lysosome-associated membrane glycoprotein (Lamp) 16784 Lysosome-associated membrane glycoprotein (Lamp) 16785 Ribosomal protein S2 16790 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 16792 Golgi 4-transmembrane spanning transporter 16795 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 16796 LIM domain. This family represents two copies of the LIM structural domain 16798 Protein kinase domain 16798 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 16803 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 16814 Homeobox domain 16815 Homeobox domain 16816 Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol 16818 SH2 domain 16818 Protein kinase domain 16818 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 16819 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 16820 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 16821 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 16825 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i 16826 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i 16828 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 16828 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 16835 Low-density lipoprotein receptor domain class A 16835 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 16840 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 16842 HMG (high mobility group) box 16846 Leptin 16848 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 16852 Galactoside-binding lectin 16854 Galactoside-binding lectin 16855 Galactoside-binding lectin 16867 7 transmembrane receptor (rhodopsin family) 16869 Homeobox domain 16869 LIM domain. This family represents two copies of the LIM structural domain 16870 Homeobox domain 16870 LIM domain. This family represents two copies of the LIM structural domain 16871 Homeobox domain 16871 LIM domain. This family represents two copies of the LIM structural domain 16872 Homeobox domain 16872 LIM domain. This family represents two copies of the LIM structural domain 16873 Homeobox domain 16873 LIM domain. This family represents two copies of the LIM structural domain 16874 Homeobox domain 16874 LIM domain. This family represents two copies of the LIM structural domain 16875 Homeobox domain 16875 LIM domain. This family represents two copies of the LIM structural domain 16876 Homeobox domain 16876 LIM domain. This family represents two copies of the LIM structural domain 16878 LIF / OSM family 16880 Fibronectin type III domain 16882 ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653 16882 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 16882 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc 16885 Protein kinase domain 16885 LIM domain. This family represents two copies of the LIM structural domain 16885 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 16886 LIM domain. This family represents two copies of the LIM structural domain 16886 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 16886 Protein kinase domain 16886 LIM domain. This family represents two copies of the LIM structural domain 16886 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 16889 ab-hydrolase associated lipase region 16891 Lipase 16891 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 16905 Intermediate filament protein 16905 Intermediate filament tail domain 16906 Intermediate filament protein 16906 Intermediate filament tail domain 16907 Intermediate filament protein 16907 Intermediate filament tail domain 16909 LIM domain. This family represents two copies of the LIM structural domain 16911 LIM domain. This family represents two copies of the LIM structural domain 16912 Proteasome A-type and B-type 16913 Proteasome A-type and B-type 16913 Proteasome A-type and B-type 16917 Homeobox domain 16917 LIM domain. This family represents two copies of the LIM structural domain 16918 Helix-loop-helix DNA-binding domain 16918 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 16948 Lysyl oxidase 16950 Lysyl oxidase 16952 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 16956 Lipase 16956 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 16956 Lipase 16956 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 16969 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 16971 Low-density lipoprotein receptor domain class A 16971 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 16973 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 16974 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 16975 Low-density lipoprotein receptor domain class A 16975 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 16980 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 16980 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 16981 Fibronectin type III domain 16981 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 16985 Caldesmon 16992 TNF(Tumor Necrosis Factor) family 16993 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 16994 TNF(Tumor Necrosis Factor) family 16995 7 transmembrane receptor (rhodopsin family) 16997 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 16998 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 17000 TNFR/NGFR cysteine-rich region 17001 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 17002 Transferrin 17025 Delta-aminolevulinic acid dehydratase 17057 Lectin C-type domain. This family includes both long and short form C-type 17058 Lectin C-type domain. This family includes both long and short form C-type 17063 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 17068 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 17069 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 17075 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 17076 Lectin C-type domain. This family includes both long and short form C-type 17076 Lectin C-type domain. This family includes both long and short form C-type 17079 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 17079 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 17083 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 17083 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 17084 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 17087 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 17101 Beige/BEACH domain 17113 Cation-dependent mannose-6-phosphate receptor 17116 Mab-21 protein 17119 Helix-loop-helix DNA-binding domain 17125 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 17125 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 17126 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 17126 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 17127 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 17127 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 17128 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 17128 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 17129 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 17129 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 17131 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 17131 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 17133 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 17134 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 17135 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 17137 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17138 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17139 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17139 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 17140 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17141 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17142 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17144 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17145 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17146 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17147 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17149 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the 17155 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 17156 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 17156 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 17158 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 17159 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 17160 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 17161 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 17164 Protein kinase domain 17167 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 17168 Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA 17169 Kinase associated domain 1 17169 Kinase associated domain 1 17171 7 transmembrane receptor (rhodopsin family) 17171 7 transmembrane receptor (rhodopsin family) 17174 Trypsin 17174 CUB domain 17174 Trypsin 17174 CUB domain 17174 Sushi domain (SCR repeat) 17178 ATP1G1/PLM/MAT8 family 17180 von Willebrand factor type A domain 17181 von Willebrand factor type A domain 17181 von Willebrand factor type A domain 17182 von Willebrand factor type A domain 17183 von Willebrand factor type A domain 17187 Helix-loop-helix DNA-binding domain 17189 Globin 17190 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 17191 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has 17194 Lectin C-type domain. This family includes both long and short form C-type 17195 Lectin C-type domain. This family includes both long and short form C-type 17196 Myelin basic protein 17199 7 transmembrane receptor (rhodopsin family) 17200 7 transmembrane receptor (rhodopsin family) 17201 7 transmembrane receptor (rhodopsin family) 17202 7 transmembrane receptor (rhodopsin family) 17203 7 transmembrane receptor (rhodopsin family) 17207 PH domain. PH stands for pleckstrin homology 17207 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 17207 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17207 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 17210 Apoptosis regulator proteins, Bcl-2 family 17215 MCM2/3/5 family 17219 MCM2/3/5 family 17220 MCM2/3/5 family 17228 Trypsin 17231 Trypsin 17232 Trypsin 17242 PTN/MK heparin-binding protein family 17246 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do 17248 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do 17252 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 17258 SRF-type transcription factor (DNA-binding and dimerisation domain) 17259 SRF-type transcription factor (DNA-binding and dimerisation domain) 17260 SRF-type transcription factor (DNA-binding and dimerisation domain) 17261 SRF-type transcription factor (DNA-binding and dimerisation domain) 17274 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 17276 ENV polyprotein (coat polyprotein) 17279 Kinase associated domain 1 17281 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 17281 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 17281 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 17281 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 17285 Homeobox domain 17285 Homeobox domain 17286 Homeobox domain 17286 Homeobox domain 17287 MAM domain. An extracellular domain found in many receptors 17287 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 17287 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 17287 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 17288 MAM domain. An extracellular domain found in many receptors 17288 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 17288 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 17288 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 17289 Fibronectin type III domain 17292 Helix-loop-helix DNA-binding domain 17292 Helix-loop-helix DNA-binding domain 17293 Helix-loop-helix DNA-binding domain 17295 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 17295 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 17295 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 17299 Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity 17300 Fork head domain 17301 Fork head domain 17304 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 17304 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 17305 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 17306 Synaptophysin / synaptoporin 17308 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o 17311 Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site 17312 Hepatic lectin, N-terminal domain 17319 Macrophage migration inhibitory factor (MIF) 17329 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 17329 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 17329 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 17330 Histidine acid phosphatase 17339 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 17341 Helix-loop-helix DNA-binding domain 17342 Helix-loop-helix DNA-binding domain 17344 pfam02891, zf-MIZ, MIZ zinc finger 17344 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 17381 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17381 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17381 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17384 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17384 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17384 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17385 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17385 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17386 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17386 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17386 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17387 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17387 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17387 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17388 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17388 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17388 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17389 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17389 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17389 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17390 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17390 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17390 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17391 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17391 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17391 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17392 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17392 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17392 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17393 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17393 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17394 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17394 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17394 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 17395 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 17395 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 17423 Sulfotransferase protein 17425 Fork head domain 17425 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 17427 V-type ATPase 116kDa subunit family 17428 Helix-loop-helix DNA-binding domain 17434 MoaE protein. This family contains the MoaE protein that is involved in biosynthesis of molybdopterin. Molybdopterin, the universal component of the pterin molybdenum cofactors, contains a dithiolene group serving to bind Mo. Addition of the dithiolene s 17436 Malic enzyme, N-terminal domain 17441 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 17444 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17444 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17451 Protein kinase domain 17463 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 17472 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 17472 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 17472 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 17472 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 17523 Animal haem peroxidase 17524 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 17527 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 17533 Lectin C-type domain. This family includes both long and short form C-type 17534 Lectin C-type domain. This family includes both long and short form C-type 17535 Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o 17686 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of 17687 DNA mismatch repair proteins, mutS family 17687 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 17687 DNA mismatch repair proteins, mutS family 17687 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 17688 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 17698 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 17698 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 17700 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 17701 Homeobox domain 17702 Homeobox domain 17703 Homeobox domain 17713 GrpE 17714 GrpE 17714 GrpE 17749 DNA directed RNA polymerase, 7 kDa subunit 17755 Presenilin 17756 Tau and MAP protein, tubulin-binding repeat 17758 Tau and MAP protein, tubulin-binding repeat 17762 Tau and MAP protein, tubulin-binding repeat 17765 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 17766 NUDIX domain 17768 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 17769 Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barr 17771 Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in 17772 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 17773 7 transmembrane receptor (rhodopsin family) 17773 7 transmembrane receptor (rhodopsin family) 17775 Golgi 4-transmembrane spanning transporter 17775 Golgi 4-transmembrane spanning transporter 17777 Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitell 17829 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 17831 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 17831 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe 17832 B-box zinc finger 17832 Zinc finger, C3HC4 type (RING finger) 17832 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 17833 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 17836 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 17836 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 17837 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 17837 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 17840 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 17842 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 17843 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 17844 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 17850 Methylmalonyl-CoA mutase. The enzyme methylmalonyl-CoA mutase is a member of a class of enzymes that uses coenzyme B12 (adenosylcobalamin) as a cofactor. The enzyme induces the formation of an adenosyl radical from the cofactor. This radical then initiat 17857 Dynamin GTPase effector domain 17857 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 17858 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 17859 Helix-loop-helix DNA-binding domain 17864 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 17869 Helix-loop-helix DNA-binding domain 17869 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 17870 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 17873 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 17874 Death domain 17874 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 17877 Helix-loop-helix DNA-binding domain 17877 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 17877 Helix-loop-helix DNA-binding domain 17877 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s 17878 Helix-loop-helix DNA-binding domain 17878 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 17880 Myosin head (motor domain) 17880 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 17880 Intermediate filament protein 17880 Myosin head (motor domain) 17880 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 17880 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 17880 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 17880 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 17880 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 17882 Myosin head (motor domain) 17884 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 17886 Myosin head (motor domain) 17886 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 17886 Myosin head (motor domain) 17888 Myosin head (motor domain) 17888 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 17909 PH domain. PH stands for pleckstrin homology 17909 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 17909 Myosin head (motor domain) 17909 PH domain. PH stands for pleckstrin homology 17909 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 17909 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 17918 Myosin head (motor domain) 17918 DIL domain. The DIL domain has no known function 17919 DIL domain. The DIL domain has no known function 17922 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 17922 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 17925 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 17927 Helix-loop-helix DNA-binding domain 17927 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 17928 Helix-loop-helix DNA-binding domain 17928 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 17930 Fibronectin type III domain 17931 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 17931 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 17932 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 17933 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 17939 Melibiase 17952 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 17954 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 17955 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 17960 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 17961 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 17962 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 17965 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 17967 Fibronectin type III domain 17968 Fibronectin type III domain 17969 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17970 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17973 SH2 domain 17973 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17974 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 17978 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 17984 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 17988 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 17990 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 17996 Nebulin repeat 17999 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 18000 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 18001 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 18001 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 18001 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 18002 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 18007 Fibronectin type III domain 18008 Intermediate filament protein 18012 Helix-loop-helix DNA-binding domain 18012 Helix-loop-helix DNA-binding domain 18013 Helix-loop-helix DNA-binding domain 18014 Helix-loop-helix DNA-binding domain 18016 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 18016 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 18021 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 18027 CTF/NF-I family 18027 CTF/NF-I family 18028 CTF/NF-I family 18029 CTF/NF-I family 18032 CTF/NF-I family 18033 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 18034 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 18039 Intermediate filament protein 18039 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 18040 Intermediate filament protein 18041 Aminotransferase class-V 18041 Pyridoxal-dependent decarboxylase conserved domain 18041 DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC and StsS. The aminotransferase activity 18045 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 18048 Trypsin 18049 Nerve growth factor family 18050 Trypsin 18053 TNFR/NGFR cysteine-rich region 18054 Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins 18073 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 18073 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 18073 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 18073 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 18074 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 18087 Cyclophilin type peptidyl-prolyl cis-trans isomerase 18088 Homeobox domain 18089 Homeobox domain 18091 Homeobox domain 18092 Homeobox domain 18094 Homeobox domain 18095 Homeobox domain 18096 Homeobox domain 18101 7 transmembrane receptor (rhodopsin family) 18102 Nucleoside diphosphate kinase 18103 Nucleoside diphosphate kinase 18104 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 18105 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 18109 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 18113 NNMT/PNMT/TEMT family 18115 NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide 18117 Uncharacterised protein family (UPF0172) 18124 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 18124 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 18125 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 18125 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 18126 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 18127 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 18128 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 18130 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 18131 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 18140 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 18140 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 18140 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 18141 RanBP1 domain 18145 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 18148 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 18150 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 18155 Vertebrate endogenous opioids neuropeptide 18155 Vertebrate endogenous opioids neuropeptide 18158 Atrial natriuretic peptide 18159 Atrial natriuretic peptide 18160 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 18162 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 18166 7 transmembrane receptor (rhodopsin family) 18167 7 transmembrane receptor (rhodopsin family) 18168 7 transmembrane receptor (rhodopsin family) 18169 7 transmembrane receptor (rhodopsin family) 18171 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 18171 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 18171 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 18171 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 18173 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined 18174 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined 18175 Nebulin repeat 18183 Neuregulin family 18185 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 18186 CUB domain 18186 MAM domain. An extracellular domain found in many receptors 18186 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 18187 CUB domain 18187 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 18187 CUB domain 18187 MAM domain. An extracellular domain found in many receptors 18187 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 18188 Transforming growth factor beta like domain 18189 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 18189 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 18191 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 18193 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 18193 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 18194 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 18195 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o 18198 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 18201 Beige/BEACH domain 18205 Nerve growth factor family 18208 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 18209 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 18211 Protein kinase domain 18214 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 18216 7 transmembrane receptor (rhodopsin family) 18217 7 transmembrane receptor (rhodopsin family) 18217 7 transmembrane receptor (rhodopsin family) 18218 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 18221 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 18222 Phosphotyrosine interaction domain (PTB/PID) 18223 Phosphotyrosine interaction domain (PTB/PID) 18227 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 18227 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 18241 Ocular albinism type 1 protein 18241 Ocular albinism type 1 protein 18241 7 transmembrane receptor (Secretin family) 18241 Ocular albinism type 1 protein 18241 7 transmembrane receptor (Secretin family) 18245 Ornithine decarboxylase antizyme 18247 Ornithine decarboxylase antizyme 18249 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 18256 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 18260 Occludin/ELL family 18263 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 18291 Homeobox domain 18292 Homeobox domain 18293 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 18293 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 18301 ATP1G1/PLM/MAT8 family 18302 Zona pellucida-like domain 18310 7 transmembrane receptor (rhodopsin family) 18312 7 transmembrane receptor (rhodopsin family) 18313 7 transmembrane receptor (rhodopsin family) 18314 7 transmembrane receptor (rhodopsin family) 18315 7 transmembrane receptor (rhodopsin family) 18315 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 18316 7 transmembrane receptor (rhodopsin family) 18318 7 transmembrane receptor (rhodopsin family) 18320 7 transmembrane receptor (rhodopsin family) 18321 7 transmembrane receptor (rhodopsin family) 18322 7 transmembrane receptor (rhodopsin family) 18322 7 transmembrane receptor (rhodopsin family) 18323 7 transmembrane receptor (rhodopsin family) 18324 7 transmembrane receptor (rhodopsin family) 18325 7 transmembrane receptor (rhodopsin family) 18329 7 transmembrane receptor (rhodopsin family) 18330 7 transmembrane receptor (rhodopsin family) 18331 7 transmembrane receptor (rhodopsin family) 18332 7 transmembrane receptor (rhodopsin family) 18340 7 transmembrane receptor (rhodopsin family) 18341 7 transmembrane receptor (rhodopsin family) 18342 7 transmembrane receptor (rhodopsin family) 18344 7 transmembrane receptor (rhodopsin family) 18345 7 transmembrane receptor (rhodopsin family) 18346 7 transmembrane receptor (rhodopsin family) 18347 7 transmembrane receptor (rhodopsin family) 18348 7 transmembrane receptor (rhodopsin family) 18349 7 transmembrane receptor (rhodopsin family) 18350 7 transmembrane receptor (rhodopsin family) 18351 7 transmembrane receptor (rhodopsin family) 18352 7 transmembrane receptor (rhodopsin family) 18354 7 transmembrane receptor (rhodopsin family) 18355 7 transmembrane receptor (rhodopsin family) 18356 7 transmembrane receptor (rhodopsin family) 18357 7 transmembrane receptor (rhodopsin family) 18357 7 transmembrane receptor (rhodopsin family) 18358 7 transmembrane receptor (rhodopsin family) 18359 7 transmembrane receptor (rhodopsin family) 18361 7 transmembrane receptor (rhodopsin family) 18362 7 transmembrane receptor (rhodopsin family) 18363 7 transmembrane receptor (rhodopsin family) 18365 7 transmembrane receptor (rhodopsin family) 18366 7 transmembrane receptor (rhodopsin family) 18367 7 transmembrane receptor (rhodopsin family) 18368 7 transmembrane receptor (rhodopsin family) 18369 7 transmembrane receptor (rhodopsin family) 18370 7 transmembrane receptor (rhodopsin family) 18373 7 transmembrane receptor (rhodopsin family) 18386 7 transmembrane receptor (rhodopsin family) 18387 7 transmembrane receptor (rhodopsin family) 18389 7 transmembrane receptor (rhodopsin family) 18390 7 transmembrane receptor (rhodopsin family) 18393 Origin recognition complex subunit 2. All DNA replication initiation is driven by a single conserved eukaryotic initiator complex termed he origin recognition complex (ORC). The ORC is a six protein complex. The function of ORC is reviewed in 18399 Sugar (and other) transporter 18405 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 18406 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 18408 Mitochondrial carrier protein 18408 Mitochondrial carrier protein 18413 LIF / OSM family 18414 Fibronectin type III domain 18415 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 18417 PMP-22/EMP/MP20/Claudin family 18419 von Willebrand factor type D domain 18420 Homeobox domain 18423 Homeobox domain 18423 Otx1 transcription factor 18424 Homeobox domain 18424 Otx1 transcription factor 18429 Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5 18430 7 transmembrane receptor (rhodopsin family) 18436 ATP P2X receptor 18438 ATP P2X receptor 18439 ATP P2X receptor 18440 ATP P2X receptor 18441 7 transmembrane receptor (rhodopsin family) 18442 7 transmembrane receptor (rhodopsin family) 18451 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 18453 Calsequestrin 18453 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 18458 Poly-adenylate binding protein, unique domain 18475 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu 18476 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu 18478 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 18479 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 18481 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 18483 Paralemmin 18484 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 18484 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 18488 Fibronectin type III domain 18489 Lectin C-type domain. This family includes both long and short form C-type 18491 Sushi domain (SCR repeat) 18503 'Paired box' domain 18505 Homeobox domain 18505 'Paired box' domain 18506 Homeobox domain 18506 'Paired box' domain 18507 'Paired box' domain 18508 Homeobox domain 18508 'Paired box' domain 18509 Homeobox domain 18509 'Paired box' domain 18510 'Paired box' domain 18511 'Paired box' domain 18514 Homeobox domain 18514 PBX domain. The PBX domain is a bipartite acidic domain 18515 Homeobox domain 18515 PBX domain. The PBX domain is a bipartite acidic domain 18516 Homeobox domain 18516 PBX domain. The PBX domain is a bipartite acidic domain 18518 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 18521 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 18530 Cadherin domain 18534 Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate 18537 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 18538 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 18542 CUB domain 18548 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic 18548 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 18549 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 18550 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 18551 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic 18551 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 18551 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont 18552 Furin-like cysteine rich region 18552 Giardia variant-specific surface protein 18552 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d 18552 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont 18553 Furin-like cysteine rich region 18553 Giardia variant-specific surface protein 18553 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d 18553 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont 18554 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 18567 MYND finger 18567 Programmed cell death protein 2, C-terminal domain 18571 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 18573 3'5'-cyclic nucleotide phosphodiesterase 18574 3'5'-cyclic nucleotide phosphodiesterase 18575 3'5'-cyclic nucleotide phosphodiesterase 18576 3'5'-cyclic nucleotide phosphodiesterase 18577 3'5'-cyclic nucleotide phosphodiesterase 18578 3'5'-cyclic nucleotide phosphodiesterase 18583 3'5'-cyclic nucleotide phosphodiesterase 18584 3'5'-cyclic nucleotide phosphodiesterase 18585 3'5'-cyclic nucleotide phosphodiesterase 18587 3'5'-cyclic nucleotide phosphodiesterase 18590 Platelet-derived growth factor (PDGF) 18591 Platelet-derived growth factor (PDGF) 18595 Protein kinase domain 18595 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 18595 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 18597 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 18598 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 18599 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 18600 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 18601 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 18602 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 18604 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 18605 Somatomedin B domain 18605 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 18606 Somatomedin B domain 18606 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 18609 Homeobox domain 18610 Vertebrate endogenous opioids neuropeptide 18611 Death effector domain 18612 Ets-domain 18616 Actinobacillus constitutively-expressed outer membrane lipoprotein A 18616 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 18618 Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids 18624 metallopeptidase family M24 18630 WD domain, G-beta repeat 18636 Thrombospondin type 1 domain 18637 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 18639 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 18640 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 18640 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 18640 Phosphoglycerate mutase family 18640 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 18640 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme contai 18641 Phosphofructokinase 18642 Phosphofructokinase 18646 C2 domain 18646 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 18655 Phosphoglycerate kinase 18663 Phosphoglycerate kinase 18667 Progesterone receptor 18667 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 18673 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 18674 Mitochondrial carrier protein 18676 jmjC domain 18676 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 18700 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 18704 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 18704 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 18706 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 18707 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 18707 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 18708 SH2 domain 18708 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 18708 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 18710 SH2 domain 18710 SH2 domain 18711 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 18712 Protein kinase domain 18718 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata 18719 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata 18738 Phosphatidylinositol transfer protein 18738 Phosphatidylinositol transfer protein 18739 Phosphatidylinositol transfer protein 18739 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 18740 Homeobox domain 18741 Homeobox domain 18742 OAR domain 18742 Homeobox domain 18750 C2 domain 18750 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18751 C2 domain 18751 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18752 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18753 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18754 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18755 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18759 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18760 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18761 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18761 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18762 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 18763 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 18763 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 18763 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 18763 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 18764 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 18766 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 18766 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 18766 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 18767 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 18768 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 18768 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 18769 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 18770 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 18775 Somatotropin hormone family 18776 Somatotropin hormone family 18777 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 18778 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 18780 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 18782 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 18787 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 18791 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 18792 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 18795 C2 domain 18795 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 18795 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 18797 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 18797 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 18798 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 18798 C2 domain 18798 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 18798 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 18798 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 18799 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 18802 PH domain. PH stands for pleckstrin homology 18803 C2 domain 18803 SH2 domain 18803 PH domain. PH stands for pleckstrin homology 18803 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 18803 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 18803 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 18803 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 18805 PH domain. PH stands for pleckstrin homology 18805 PX domain. PX domains bind to phosphoinositides 18805 Phospholipase D. Active site motif. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certa 18806 PX domain. PX domains bind to phosphoinositides 18810 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 18810 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 18811 Somatotropin hormone family 18811 Somatotropin hormone family 18812 Somatotropin hormone family 18813 metallopeptidase family M24 18813 metallopeptidase family M24 18814 Somatotropin hormone family 18815 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 18817 POLO box duplicated region 18821 Phospholamban. The regulation of calcium levels across the membrane of the sarcoplasmic reticulum involves the interplay of many membrane proteins. Phospholamban is a 52 residue integral membrane protein that is involved in reversibly inhibiting the Ca(2 18823 Myelin proteolipid protein (PLP or lipophilin) 18828 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 18844 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 18844 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 18845 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 18845 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 18845 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans prot 18845 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 18846 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 18846 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 18858 PMP-22/EMP/MP20/Claudin family 18933 Homeobox domain 18935 Homeobox domain 18946 Lipase 18946 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 18947 Lipase 18948 NNMT/PNMT/TEMT family 18950 Phosphorylase family 2 18952 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 18968 DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities 18969 DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner 18974 DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme con 18974 DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner 18974 DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons 18975 DNA polymerase family A 18976 Corticotropin ACTH domain 18979 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. 18980 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. 18986 Homeobox domain 18987 Homeobox domain 18988 Homeobox domain 18996 Homeobox domain 18996 Pou domain - N-terminal to homeobox domain 18997 Pou domain - N-terminal to homeobox domain 18998 Pou domain - N-terminal to homeobox domain 18999 Pou domain - N-terminal to homeobox domain 19011 Somatomedin B domain 19012 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 19013 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19013 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19015 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19015 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19016 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19016 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19024 pfam02920, integrase_DNA, DNA binding domain of tn916 integrase 19024 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 19025 Serine carboxypeptidase 19027 Synaptophysin / synaptoporin 19035 Cyclophilin type peptidyl-prolyl cis-trans isomerase 19035 Cyclophilin type peptidyl-prolyl cis-trans isomerase 19038 Cyclophilin type peptidyl-prolyl cis-trans isomerase 19043 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 19047 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 19055 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 19056 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 19060 TPR Domain 19060 Uncharacterised protein family (UPF0169). Members of this family are predicted to be lipoproteins. The function of these proteins is unknown 19060 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 19062 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 19063 Palmitoyl protein thioesterase 19064 Pancreatic hormone peptide 19065 7 transmembrane receptor (rhodopsin family) 19070 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 19075 DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits 19076 Eukaryotic-type DNA primase, large subunit. DNA primase is the polymerase that synthesises small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa 19079 pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase 19084 Cyclic nucleotide-binding domain 19084 Cyclic nucleotide-binding domain 19084 Regulatory subunit of type II PKA R-subunit 19085 Cyclic nucleotide-binding domain 19087 Cyclic nucleotide-binding domain 19088 Cyclic nucleotide-binding domain 19088 Regulatory subunit of type II PKA R-subunit 19090 Phosphatidylinositol 3- and 4-kinase 19090 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 19109 Somatotropin hormone family 19110 Somatotropin hormone family 19111 Somatotropin hormone family 19112 Somatotropin hormone family 19113 Somatotropin hormone family 19114 Somatotropin hormone family 19123 Trypsin 19123 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 19125 Proline dehydrogenase 19127 Homeobox domain 19128 Laminin G domain 19128 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 19132 Intermediate filament protein 19133 Tetraspanin family 19141 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this 19146 CUB domain 19146 MAM domain. An extracellular domain found in many receptors 19146 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 19146 CUB domain 19146 MAM domain. An extracellular domain found in many receptors 19146 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 19155 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 19156 Saposin A-type domain 19156 Saposin A-type domain 19157 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 19158 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 19159 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 19164 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 19165 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 19170 Proteasome A-type and B-type 19172 Proteasome A-type and B-type 19181 Sigma-54 interaction domain 19181 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 19181 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 19186 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 19186 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 19188 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 19188 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 19191 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 19191 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 19192 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 19192 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 19197 Transforming growth factor beta like domain 19200 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 19200 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 19201 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 19202 Homeobox domain 19205 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 19206 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 19207 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 19207 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 19207 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 19210 Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos 19212 Phosphotriesterase family 19213 Helix-loop-helix DNA-binding domain 19214 7 transmembrane receptor (rhodopsin family) 19215 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 19216 7 transmembrane receptor (rhodopsin family) 19217 7 transmembrane receptor (rhodopsin family) 19218 7 transmembrane receptor (rhodopsin family) 19219 7 transmembrane receptor (rhodopsin family) 19220 7 transmembrane receptor (rhodopsin family) 19222 7 transmembrane receptor (rhodopsin family) 19223 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 19224 Animal haem peroxidase 19224 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 19225 Animal haem peroxidase 19225 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 19225 Animal haem peroxidase 19225 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 19225 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 19226 Parathyroid hormone family 19227 Parathyroid hormone family 19228 7 transmembrane receptor (Secretin family) 19229 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 19229 Protein kinase domain 19229 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion tar 19242 PTN/MK heparin-binding protein family 19243 Protein-tyrosine phosphatase 19243 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 19247 SH2 domain 19252 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 19258 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 19258 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 19260 Paramyxovirus P phosphoprotein. This family consists of paramyxovirus P phosphoprotein from sendai virus and human and bovine parainfluenza viruses. The P protein is an essential part of the viral RNA polymerase complex formed form the P and L proteins. 19261 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 19263 Fibronectin type III domain 19266 Fibronectin type III domain 19266 Protein-tyrosine phosphatase 19266 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 19268 Fibronectin type III domain 19270 Eukaryotic-type carbonic anhydrase 19271 Fibronectin type III domain 19272 Fibronectin type III domain 19273 Fibronectin type III domain 19274 Fibronectin type III domain 19280 Fibronectin type III domain 19280 Fibronectin type III domain 19281 Fibronectin type III domain 19283 Fibronectin type III domain 19283 Eukaryotic-type carbonic anhydrase 19283 Fibronectin type III domain 19283 Eukaryotic-type carbonic anhydrase 19283 Protein-tyrosine phosphatase 19283 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 19286 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. 19289 Fibronectin type III domain 19293 EF hand 19301 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 19302 Zinc finger, C3HC4 type (RING finger) 19303 Paxillin family 19303 LIM domain. This family represents two copies of the LIM structural domain 19303 Paxillin family 19303 LIM domain. This family represents two copies of the LIM structural domain 19309 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 19309 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 19324 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19325 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19328 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19329 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19331 ADP-ribosylation factor family 19331 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19332 ADP-ribosylation factor family 19332 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19334 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19335 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19336 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19344 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19344 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19345 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19345 ADP-ribosylation factor family 19345 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19347 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 19347 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 19348 Kinesin motor domain 19353 ADP-ribosylation factor family 19353 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 19355 Repair protein Rad1/Rec1/Rad17 19356 Rad17 cell cycle checkpoint protein 19358 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 19359 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 19359 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 19365 Rad52/22 family double-strand break repair protein 19367 Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage 19368 Class I Histocompatibility antigen, domains alpha 1 and 2 19369 Class I Histocompatibility antigen, domains alpha 1 and 2 19370 Class I Histocompatibility antigen, domains alpha 1 and 2 19374 Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activate 19378 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 19385 RanBP1 domain 19385 RanBP1 domain 19385 RanBP1 domain 19386 RanBP1 domain 19386 Zn-finger in Ran binding protein and others 19386 Cyclophilin type peptidyl-prolyl cis-trans isomerase 19395 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 19395 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 19401 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19401 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19411 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19411 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19414 C2 domain 19414 PH domain. PH stands for pleckstrin homology 19414 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 19414 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 19415 C2 domain 19417 Guanine nucleotide exchange factor for Ras-like GTPases 19417 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 19418 PH domain. PH stands for pleckstrin homology 19418 Guanine nucleotide exchange factor for Ras-like GTPases 19418 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 19419 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 19434 OAR domain 19434 Homeobox domain 19645 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 19645 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve 19649 Fibronectin type III domain 19650 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 19650 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserved 19651 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 19651 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve 19653 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol 19659 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 19660 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 19661 Interphotoreceptor retinoid-binding protein. Interphotoreceptor retinoid-binding protein (IRBP) mediates retinoid trafficking between the photoreceptors and pigmented epithelium. This Pfam family represents a repeat domain found four times in the mammali 19662 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 19663 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 19664 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 19668 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 19671 CAAX amino terminal protease family. Members of this family are probably proteases. the family contains CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidi 19679 Phosphatidylinositol transfer protein 19679 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 19682 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 19683 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 19684 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 19684 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 19687 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 19694 Lectin C-type domain. This family includes both long and short form C-type 19696 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 19697 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 19698 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha 19699 Reeler domain 19701 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 19702 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 19708 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 19712 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 19713 Cadherin domain 19714 DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold 19714 DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities 19718 Rad17 cell cycle checkpoint protein 19718 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 19719 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 19720 B-box zinc finger 19720 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 19724 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 19725 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 19726 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 19726 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 19729 MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these p 19730 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 19731 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 19732 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 19733 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 19734 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 19735 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 19736 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 19743 Ammonium Transporter Family 19746 Ammonium Transporter Family 19763 Zinc finger, C3HC4 type (RING finger) 19763 Zinc finger, C3HC4 type (RING finger) 19765 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 19766 Death domain 19771 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 19771 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 19773 Insulin/IGF/Relaxin family. Superfamily includes insulins 19775 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be 19775 SPX domain. We have named this region the SPX domain after (SYG1, Pho81 and XPR1). This 180 residue length domain is found at the amino terminus of a variety of proteins. In the yeast protein SYG1, the N-terminus directly binds to the G- protein beta sub 19777 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 19876 Fibronectin type III domain 19876 Fibronectin type III domain 19876 Alphaherpesvirus glycoprotein E. Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI) (pfam01688), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation 19877 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 19879 Sugar (and other) transporter 19881 Tetraspanin family 19882 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 19882 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 19882 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 19883 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19883 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19885 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 19885 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 19886 Protein kinase domain 19886 Fibronectin type III domain 19888 Doublecortin 19892 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 19894 C2 domain 19894 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 19896 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 19899 Eukaryotic ribosomal protein L18 19899 Eukaryotic ribosomal protein L18 19921 Ribosomal protein L19e 19933 Ribosomal protein L21e 19934 Ribosomal L22e protein family 19935 Ribosomal protein L23 19942 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 19942 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 19943 Ribosomal L28e protein family 19944 Ribosomal L29e protein family 19946 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 19951 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 19981 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 19982 Ribosomal protein L44 19983 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 19988 Ribosomal protein L6e 19988 Ribosomal protein L6, N-terminal domain 19989 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 20014 ENV polyprotein (coat polyprotein) 20017 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com 20018 RNA polymerases L / 13 to 16 kDa subunit 20019 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu 20019 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 20020 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 20020 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu 20022 RNA polymerases L / 13 to 16 kDa subunit 20024 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p1 20028 Phosducin 20042 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 20055 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 20055 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 20055 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 20068 Ribosomal S17 20084 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 20085 Ribosomal protein S19e 20090 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 20091 Ribosomal S3Ae family 20103 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 20104 Ribosomal protein S6e 20115 Ribosomal protein S7e 20116 Ribosomal protein S8e 20128 Zinc finger, C3HC4 type (RING finger) 20129 High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, gluteni 20132 7 transmembrane receptor (rhodopsin family) 20133 Ribonucleotide reductase, barrel domain 20135 Ribonucleotide reductase, small chain 20147 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 20167 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 20168 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 20181 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 20181 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 20182 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 20182 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 20182 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 20183 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 20183 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 20185 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 20186 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 20186 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 20187 Protein kinase domain 20187 WIF domain. The WIF domain is found in the RYK tyrosine kinase receptors and WIF the Wnt-inhibitory- factor. The domain is extracellular and and contains two conserved cysteines that may form a disulphide bridge. This domain is Wnt binding in WIF, and it 20191 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 20191 RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown 20191 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 20191 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 20193 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20194 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20195 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20196 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20197 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20198 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20199 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20200 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20200 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20201 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20202 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20203 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20203 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20203 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 20204 Homeobox domain 20208 Serum amyloid A protein 20210 Serum amyloid A protein 20211 Serum amyloid A protein 20211 Serum amyloid A protein 20215 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 20215 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 20219 Pentaxin family. Pentaxins are also known as pentraxins 20226 Seryl-tRNA synthetase N-terminal domain. This domain is found associated with the Pfam tRNA synthetase class II domain (pfam00587) and represents the N-terminal domain of seryl-tRNA synthetase 20226 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 20227 SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors 20229 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 20230 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 20231 Homeobox domain 20234 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 20239 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 20248 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20254 Granin (chromogranin or secretogranin) 20257 Stathmin family 20262 Stathmin family 20264 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20264 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20265 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20269 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20271 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20272 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20272 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20273 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 20276 Amiloride-sensitive sodium channel 20277 Amiloride-sensitive sodium channel 20278 Amiloride-sensitive sodium channel 20280 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 20280 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 20280 Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains 20280 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 20286 TPR Domain 20287 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 20288 Macrophage scavenger receptor 20288 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 20290 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20296 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20302 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20303 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20304 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20304 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20305 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20306 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20309 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20310 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20311 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20315 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20315 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 20321 Reeler domain 20334 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2 20334 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 20336 Sec8 exocyst complex component specific domain 20338 Fibronectin type II domain 20339 Lectin C-type domain. This family includes both long and short form C-type 20340 Cysteine rich repeat. This cysteine rich repeat contains four cysteines. It is found in multiple copies in a protein that binds to fibroblast growth factors. The repeat is also found in MG160 and E-selectin ligand (ESL-1) 20343 Lectin C-type domain. This family includes both long and short form C-type 20344 Sushi domain (SCR repeat) 20344 Lectin C-type domain. This family includes both long and short form C-type 20346 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20346 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20347 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20347 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20348 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20349 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20350 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20351 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20352 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20352 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans prot 20352 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 20353 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20353 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 20354 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20355 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20355 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20356 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20356 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 20356 Thrombospondin type 1 domain 20356 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20356 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans prot 20356 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 20357 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 20357 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 20362 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 20362 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 20370 Sushi domain (SCR repeat) 20375 Ets-domain 20378 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 20379 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 20387 Lectin C-type domain. This family includes both long and short form C-type 20388 Surfactant protein B 20388 Saposin A-type domain 20388 Saposin A-type domain 20388 Surfactant protein B 20390 Lectin C-type domain. This family includes both long and short form C-type 20401 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 20411 Sorbin homologous domain 20415 Laminin G domain 20416 SH2 domain 20416 Phosphotyrosine interaction domain (PTB/PID) 20418 Phosphotyrosine interaction domain (PTB/PID) 20423 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 20423 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 20425 Serine hydroxymethyltransferase 20429 Homeobox domain 20437 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 20438 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 20439 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 20439 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 20440 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20441 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20442 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20443 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20444 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20444 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20445 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20446 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20447 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20448 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20449 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20450 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20451 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20452 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20454 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 20463 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 20465 Helix-loop-helix DNA-binding domain 20465 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 20465 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 20469 Rap/ran-GAP 20469 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 20471 Homeobox domain 20472 Homeobox domain 20473 Homeobox domain 20474 Homeobox domain 20479 MIT domain 20481 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 20482 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 20482 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 20482 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 20493 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 20494 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 20498 K-Cl Co-transporter type 1 (KCC1) 20500 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 20505 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 20509 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 20510 Sodium:dicarboxylate symporter family 20511 Sodium:dicarboxylate symporter family 20511 Sodium:dicarboxylate symporter family 20512 Sodium:dicarboxylate symporter family 20513 Sodium:dicarboxylate symporter family 20514 Sodium:dicarboxylate symporter family 20515 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 20516 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 20516 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 20517 Sugar (and other) transporter 20518 Sugar (and other) transporter 20519 Sugar (and other) transporter 20519 Sugar (and other) transporter 20520 Sugar (and other) transporter 20521 Sugar (and other) transporter 20522 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 20523 Mitochondrial carrier protein 20524 Mitochondrial carrier protein 20524 Mitochondrial carrier protein 20525 Sugar (and other) transporter 20526 Sugar (and other) transporter 20527 Sugar (and other) transporter 20528 Sugar (and other) transporter 20529 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi 20530 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi 20531 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 20532 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 20533 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 20535 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 20536 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 20537 Sodium:solute symporter family 20538 Sodium:neurotransmitter symporter family 20541 Calx-beta domain 20541 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 20544 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 20563 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 20563 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 20563 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 20563 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 20563 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 20563 Leucine Rich Repeat. CAUTION: This Pfam may not find all Leucine Rich Repeats in a protein. Leucine Rich Repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in 20563 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 20564 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 20564 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 20564 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 20567 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 20567 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 20567 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 20567 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 20568 WAP-type (Whey Acidic Protein) 'four-disulfide core' 20585 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 20586 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 20586 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 20586 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 20586 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 20591 jmjC domain 20591 jmjN domain 20591 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 20591 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 20591 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 20592 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 20596 Synaptophysin / synaptoporin 20596 7 transmembrane receptor (Secretin family) 20596 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 20596 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 20597 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 20598 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 20602 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 20603 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 20604 Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som 20605 7 transmembrane receptor (rhodopsin family) 20606 7 transmembrane receptor (rhodopsin family) 20607 7 transmembrane receptor (rhodopsin family) 20608 7 transmembrane receptor (rhodopsin family) 20611 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 20611 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 20614 SNAP-25 family. SNAP-25 (synaptosome-associated protein 25 kDa) proteins are components of SNARE complexes. Members of this family contain a cluster of cysteine residues that can be palmitoylated for membrane attachment 20616 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 20617 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 20618 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 20620 POLO box duplicated region 20623 Protein kinase domain 20638 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 20639 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 20641 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 20643 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 20648 PH domain. PH stands for pleckstrin homology 20649 PH domain. PH stands for pleckstrin homology 20649 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 20650 PH domain. PH stands for pleckstrin homology 20655 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 20657 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 20660 Fibronectin type III domain 20663 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 20664 HMG (high mobility group) box 20665 HMG (high mobility group) box 20666 HMG (high mobility group) box 20669 HMG (high mobility group) box 20670 HMG (high mobility group) box 20671 HMG (high mobility group) box 20672 HMG (high mobility group) box 20672 HMG (high mobility group) box 20674 HMG (high mobility group) box 20675 HMG (high mobility group) box 20677 HMG (high mobility group) box 20677 HMG (high mobility group) box 20678 HMG (high mobility group) box 20679 HMG (high mobility group) box 20680 HMG (high mobility group) box 20681 HMG (high mobility group) box 20681 HMG (high mobility group) box 20682 HMG (high mobility group) box 20684 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 20684 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 20687 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 20687 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 20688 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 20689 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 20689 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 20690 Hyaluronidase 20692 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 20700 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20701 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20702 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20703 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20704 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20715 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20716 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20717 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20720 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20724 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 20729 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 20729 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 20730 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 20733 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 20739 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 20739 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 20740 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 20740 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 20740 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 20740 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 20741 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 20742 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 20742 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 20743 PH domain. PH stands for pleckstrin homology 20743 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 20744 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 20745 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 20747 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 20747 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 20750 Osteopontin 20751 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 20766 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 20778 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 20779 SH2 domain 20779 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 20787 Helix-loop-helix DNA-binding domain 20788 Helix-loop-helix DNA-binding domain 20807 SRF-type transcription factor (DNA-binding and dimerisation domain) 20810 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 20811 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 20813 Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding 20818 ADP-ribosylation factor family 20821 B-box zinc finger 20821 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 20826 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family 20826 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 20833 HMG (high mobility group) box 20833 Structure-specific recognition protein 20840 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 20842 Stromal antigen (SA/STAG) protein 20843 Stromal antigen (SA/STAG) protein 20844 VHS domain. Domain present in VPS-27, Hrs and STAM 20846 SH2 domain 20846 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20846 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 20846 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20847 SH2 domain 20847 Minor capsid protein VI. This minor capsid protein may act as a link between the external capsid and the internal DNA-protein core. The C-terminal 11 residues may function as a protease cofactor leading to enzyme activation 20847 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20847 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20847 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 20848 SH2 domain 20848 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20848 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 20848 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20849 SH2 domain 20849 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20849 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20849 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 20849 SH2 domain 20849 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fac 20849 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. Th 20849 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT 20850 SH2 domain 20850 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20850 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20850 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 20851 SH2 domain 20851 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20851 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20852 SH2 domain 20852 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 20852 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 20852 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 20853 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 20855 Stanniocalcin family 20856 Stanniocalcin family 20860 Sulfotransferase protein 20862 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 20862 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 20863 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 20865 Sulfotransferase protein 20865 Sulfotransferase protein 20867 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 20872 Protein kinase domain 20877 Protein kinase domain 20878 Protein kinase domain 20887 Sulfotransferase protein 20888 Sulfotransferase protein 20893 Helix-loop-helix DNA-binding domain 20905 Sulfatase 20907 Syntaxin 20910 Sec1 family 20911 Sec1 family 20912 Sec1 family 20913 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 20916 ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity 20916 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 20918 Translation initiation factor SUI1 20924 Supt5 repeat. This short region of similarity is found in two tandem copies in Supt5 proteins that are involved in chromatin regulation. The function of this region is unknown 20927 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 20927 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 20930 SURF1 family 20931 Sodium transport protein 20932 SURF4 family 20937 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 20947 PH domain. PH stands for pleckstrin homology 20955 Synaptobrevin 20955 Synaptobrevin 20957 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 20957 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 20963 SH2 domain 20964 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 20965 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 20969 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 20970 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 20972 Synaptogyrin. This family of proteins is distantly related to pfam01284 20973 Synaptogyrin. This family of proteins is distantly related to pfam01284 20974 Synaptogyrin. This family of proteins is distantly related to pfam01284 20975 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 20977 Synaptophysin / synaptoporin 20979 C2 domain 20981 C2 domain 20997 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 21334 Neurokinin B 21336 7 transmembrane receptor (rhodopsin family) 21337 7 transmembrane receptor (rhodopsin family) 21338 7 transmembrane receptor (rhodopsin family) 21346 Calponin family repeat 21346 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 21349 Helix-loop-helix DNA-binding domain 21351 Transaldolase 21357 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 21366 Sodium:neurotransmitter symporter family 21367 Fibronectin type III domain 21371 Tubulin binding cofactor A 21372 WD domain, G-beta repeat 21386 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 21388 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 21390 7 transmembrane receptor (rhodopsin family) 21391 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 21401 Transcription factor S-II (TFIIS) 21406 Helix-loop-helix DNA-binding domain 21414 HMG (high mobility group) box 21415 HMG (high mobility group) box 21416 HMG (high mobility group) box 21417 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 21418 Transcription factor AP-2 21419 Transcription factor AP-2 21420 Transcription factor AP-2 21423 Helix-loop-helix DNA-binding domain 21425 Helix-loop-helix DNA-binding domain 21426 Helix-loop-helix DNA-binding domain 21428 Helix-loop-helix DNA-binding domain 21429 HMG (high mobility group) box 21429 HMG (high mobility group) box 21432 TCL1/MTCP1 family. Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28 21452 Eukaryotic cobalamin-binding protein 21453 Treacher Collins syndrome protein Treacle 21454 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 21461 Myosin tail 21462 Myosin tail 21645 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 21647 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 21648 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 21652 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 21665 Uracil DNA glycosylase superfamily 21665 Uracil DNA glycosylase superfamily 21673 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 21674 HMG (high mobility group) box 21676 TEA/ATTS domain family 21677 TEA/ATTS domain family 21678 TEA/ATTS domain family 21679 TEA/ATTS domain family 21682 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 21683 Zona pellucida-like domain 21683 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 21687 Fibronectin type III domain 21689 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 21743 NNMT/PNMT/TEMT family 21744 Adenosine-deaminase (editase) domain 21744 Adenosine-deaminase (editase) domain 21744 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 21753 LIM domain. This family represents two copies of the LIM structural domain 21766 Protein of unknown function (DUF396). A family of conserved eukaryotic transmembrane proteins 21769 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 21770 Protein phosphatase 2A regulatory B subunit (B56 family) 21780 HMG (high mobility group) box 21781 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 21784 Trefoil (P-type) domain 21785 Trefoil (P-type) domain 21785 Trefoil (P-type) domain 21786 Trefoil (P-type) domain 21788 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 21803 Transforming growth factor beta like domain 21803 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 21804 LIM domain. This family represents two copies of the LIM structural domain 21807 TSC-22/dip/bun family 21808 Transforming growth factor beta like domain 21808 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 21809 Transforming growth factor beta like domain 21809 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 21810 Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria 21812 Protein kinase domain 21812 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 21813 Protein kinase domain 21813 Protein kinase domain 21814 Zona pellucida-like domain 21816 Transglutaminase family 21816 Transglutaminase family, C-terminal ig like domain 21816 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 21817 Transglutaminase family 21817 Transglutaminase family, C-terminal ig like domain 21817 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 21818 Transglutaminase family 21818 Transglutaminase family, C-terminal ig like domain 21818 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 21819 Carboxylesterase 21819 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 21823 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 21825 Thrombospondin N-terminal -like domain 21825 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 21826 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 21828 Thrombospondin N-terminal -like domain 21832 Erythropoietin/thrombopoietin 21833 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 21833 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 21833 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 21833 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 21834 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 21834 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 21844 Raf-like Ras-binding domain 21846 Fibronectin type III domain 21847 Zinc finger, C2H2 type 21848 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 21848 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 21849 B-box zinc finger 21849 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 21854 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 21854 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 21855 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 21855 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 21856 Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids o 21856 Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids of 21857 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 21858 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 21869 Homeobox domain 21871 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 21872 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 21873 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 21877 Thymidine kinase 21884 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 21884 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 21885 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 21886 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 21887 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 21892 CUB domain 21892 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 21894 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 21897 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 21898 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 21899 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 21906 Protein of unknown function, DUF270 21906 Protein of unknown function, DUF270 21907 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 21907 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 21908 Homeobox domain 21909 Homeobox domain 21912 Tetraspanin family 21915 Thymidylate kinase 21916 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 21917 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 21923 Fibronectin type III domain 21927 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 21929 A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation 21930 CUB domain 21930 Extracellular link domain 21933 Death domain 21933 TNFR/NGFR cysteine-rich region 21935 Sugar (and other) transporter 21937 TNFR/NGFR cysteine-rich region 21937 TNFR/NGFR cysteine-rich region 21938 TNFR/NGFR cysteine-rich region 21942 TNFR/NGFR cysteine-rich region 21947 TNF(Tumor Necrosis Factor) family 21948 TNF(Tumor Necrosis Factor) family 21949 TNF(Tumor Necrosis Factor) family 21950 TNF(Tumor Necrosis Factor) family 21951 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 21952 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 21953 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 21954 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 21955 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropomy 21959 TPR Domain 21959 Nuclear transition protein 2 21959 FKBP-type peptidyl-prolyl cis-trans isomerases 21960 Fibronectin type III domain 21960 Fibronectin type III domain 21960 Fibrinogen beta and gamma chains, C-terminal globular domain 21968 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 21975 Topoisomerase DNA binding C4 zinc finger 21981 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 21981 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 21981 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 21981 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 21985 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 21987 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 21990 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 21991 Triosephosphate isomerase 22003 Tropomyosin 22004 Tropomyosin 22018 Animal haem peroxidase 22018 Sushi domain (SCR repeat) 22019 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 22021 Sulfotransferase protein 22022 Sulfotransferase protein 22025 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 22025 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 22026 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 22029 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22030 TRAF-type zinc finger 22030 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22030 TRAF-type zinc finger 22030 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22031 TRAF-type zinc finger 22031 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22032 TRAF-type zinc finger 22032 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22033 TRAF-type zinc finger 22033 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22034 TRAF-type zinc finger 22034 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 22038 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 22041 Transferrin 22042 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 22042 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 22045 7 transmembrane receptor (rhodopsin family) 22051 LIM domain. This family represents two copies of the LIM structural domain 22057 BTG1 family. A novel family of anti-proliferative proteins 22059 P53 22061 P53 22062 P53 22062 P53 22063 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 22064 XRCC1 N terminal domain 22064 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 22065 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 22066 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 22067 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 22068 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 22070 Translationally controlled tumor protein 22072 Trypsin 22072 Trypsin 22073 Trypsin 22074 Trypsin 22074 Trypsin 22084 Tuberin 22084 Rap/ran-GAP 22095 7 transmembrane receptor (rhodopsin family) 22099 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 22110 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 22114 Protein kinase domain 22114 Protein kinase domain 22121 Ribosomal protein L13 22130 Myb-like DNA-binding domain 22138 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 22139 Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders 22141 Tub family 22151 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 22151 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 22157 Tub family 22157 Tub family 22158 Tub family 22158 Tub family 22160 Helix-loop-helix DNA-binding domain 22163 TNFR/NGFR cysteine-rich region 22164 TNF(Tumor Necrosis Factor) family 22165 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 22166 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 22171 Thymidylate synthase 22173 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 22174 Fibronectin type III domain 22178 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 22183 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 22184 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 22186 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 22186 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 22186 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 22186 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 22186 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 22186 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 22187 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 22190 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 22192 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 22195 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 22200 Repeat in ubiquitin-activating (UBA) protein 22200 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 22201 Repeat in ubiquitin-activating (UBA) protein 22201 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 22202 Repeat in ubiquitin-activating (UBA) protein 22202 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 22217 Ubiquitin carboxyl-terminal hydrolase family 2 22222 Putative zinc finger in N-recognin 22223 Ubiquitin carboxyl-terminal hydrolase, family 1 22227 Mitochondrial carrier protein 22228 Mitochondrial carrier protein 22229 Mitochondrial carrier protein 22230 Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family inclu 22232 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 22236 UDP-glucoronosyl and UDP-glucosyl transferase 22238 UDP-glucoronosyl and UDP-glucosyl transferase 22239 UDP-glucoronosyl and UDP-glucosyl transferase 22240 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 22242 Zona pellucida-like domain 22245 Phosphoribulokinase / Uridine kinase family 22247 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 22247 Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phosphat 22249 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 22255 Homeobox domain 22259 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 22259 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 22260 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 22260 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 22268 Tetraspanin family 22268 Tetraspanin family 22272 pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex 22275 Uroporphyrinogen decarboxylase (URO-D) 22278 Helix-loop-helix DNA-binding domain 22282 Helix-loop-helix DNA-binding domain 22283 Fibronectin type III domain 22283 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 22287 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 22288 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 22288 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 22288 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 22288 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 22289 TPR Domain 22289 jmjC domain 22290 jmjC domain 22294 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 22295 Cadherin domain 22296 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 22296 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 22296 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 22297 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 22297 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 22301 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22302 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22303 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22304 7 transmembrane receptor (metabotropic glutamate family) 22304 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22305 7 transmembrane receptor (metabotropic glutamate family) 22305 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22308 7 transmembrane receptor (metabotropic glutamate family) 22308 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22309 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22310 7 transmembrane receptor (metabotropic glutamate family) 22310 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22311 7 transmembrane receptor (metabotropic glutamate family) 22312 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22315 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 22317 Synaptobrevin 22318 Synaptobrevin 22319 Synaptobrevin 22320 Synaptobrevin 22321 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 22321 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 22321 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 22321 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 22323 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 22324 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 22324 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 22325 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 22325 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 22326 Homeobox domain 22327 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 22327 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids, some interacts with c-Myc and repress its transcriptional activity. Suggesting that other members of this family may also regulate transcription by intera 22327 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b 22330 Vinculin family 22333 Eukaryotic porin 22334 Eukaryotic porin 22335 Eukaryotic porin 22337 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 22337 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 22339 Platelet-derived growth factor (PDGF) 22341 Platelet-derived growth factor (PDGF) 22342 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 22343 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 22346 von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA 22348 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 22350 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 22350 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 22351 Villin headpiece domain 22352 Intermediate filament protein 22353 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 22353 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 22354 7 transmembrane receptor (Secretin family) 22355 7 transmembrane receptor (Secretin family) 22359 Low-density lipoprotein receptor domain class A 22359 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 22361 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 22364 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 22364 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 22365 Sec1 family 22367 Protein kinase domain 22370 Somatomedin B domain 22370 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 22371 von Willebrand factor type A domain 22371 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 22371 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 22371 von Willebrand factor type D domain 22371 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 22371 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 22371 von Willebrand factor type D domain 22371 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 22375 WHEP-TRS domain 22376 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 22376 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 22378 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 22385 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 22390 Protein kinase domain 22402 Thrombospondin type 1 domain 22408 wnt family 22408 wnt family 22409 wnt family 22410 wnt family 22412 wnt family 22413 wnt family 22414 wnt family 22415 wnt family 22416 wnt family 22417 wnt family 22418 wnt family 22419 wnt family 22420 wnt family 22422 wnt family 22427 HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease 22427 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 22427 HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease 22427 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 22427 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 22427 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin 22431 Wilm's tumour protein 22431 Wilm's tumour protein 22436 [2Fe-2S] binding domain 22436 CO dehydrogenase flavoprotein C-terminal domain 22436 FAD binding domain in molybdopterin dehydrogenase 22436 Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain 22436 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 22437 7 transmembrane receptor (rhodopsin family) 22441 Occludin/ELL family 22589 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 22589 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 22590 XPA protein 22591 DNA repair protein Rad4 22592 XPG I-region 22594 XRCC1 N terminal domain 22594 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 22596 Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi 22596 Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino 22596 Ku70/Ku80 beta-barrel domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the central DNA-bin 22598 Sodium:neurotransmitter symporter family 22599 Sodium:neurotransmitter symporter family 22612 SH2 domain 22612 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 22612 SH2 domain 22612 Protein kinase domain 22612 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 22626 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict 22627 14-3-3 protein 22628 14-3-3 protein 22629 14-3-3 protein 22630 14-3-3 protein 22631 14-3-3 protein 22634 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 22635 von Willebrand factor type D domain 22635 MAM domain. An extracellular domain found in many receptors 22635 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 22635 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr 22635 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 22637 SH2 domain 22640 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22640 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 22642 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 22643 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22654 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22658 Zinc finger, C3HC4 type (RING finger) 22661 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 22666 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 22670 B-box zinc finger 22673 LIM domain. This family represents two copies of the LIM structural domain 22682 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 22687 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with othe 22688 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22689 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22691 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 22695 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 22698 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22701 Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165 22709 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22712 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22715 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22715 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22755 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22759 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22775 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 22784 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 22786 Zona pellucida-like domain 22787 Zona pellucida-like domain 22788 Zona pellucida-like domain 22789 Sushi domain (SCR repeat) 22793 LIM domain. This family represents two copies of the LIM structural domain 22795 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 22797 Helix-loop-helix DNA-binding domain 22798 Uncharacterized ACR, COG1579 22798 Intermediate filament protein 22798 Laminin N-terminal (Domain VI) 22798 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 22798 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 22798 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 22798 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 22801 von Willebrand factor type A domain 22818 Clathrin adaptor complex small chain 22821 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 22823 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 22824 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 22829 Carboxylesterase 22836 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 22836 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 22839 Guanylate-kinase-associated protein (GKAP) protein 22841 C2 domain 22846 5'-nucleotidase 22856 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 22856 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 22859 Galactose binding lectin domain 22859 7 transmembrane receptor (Secretin family) 22859 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 22859 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou 22861 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 22861 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 22861 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 22861 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 22861 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 22862 Fibronectin type III domain 22864 R3H domain 22864 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 22865 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 22870 FHIPEP family 22871 Carboxylesterase 22875 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 22879 SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs 22885 Villin headpiece domain 22885 LIM domain. This family represents two copies of the LIM structural domain 22890 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 22893 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 22895 C2 domain 22895 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 22898 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 22898 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 22900 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 22901 Sulfatase 22902 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 22903 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 22903 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 22904 Helicase conserved C-terminal domain 22908 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 22914 Lectin C-type domain. This family includes both long and short form C-type 22915 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 22925 Lectin C-type domain. This family includes both long and short form C-type 22932 Zona pellucida-like domain 22932 Zona pellucida-like domain 22933 Sir2 family 22933 Sir2 family 22936 Occludin/ELL family 22937 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 22947 Homeobox domain 22948 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 22950 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 22953 ATP P2X receptor 22953 ATP P2X receptor 22953 ATP P2X receptor 22953 ATP P2X receptor 22953 ATP P2X receptor 22953 ATP P2X receptor 22954 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alp 22955 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 22976 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 22976 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 22982 AMP-binding enzyme 22983 Protein kinase domain 22983 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 22985 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 22986 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 22987 Sugar (and other) transporter 22989 Myosin head (motor domain) 22989 Intermediate filament protein 22989 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 22989 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 22989 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 22989 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 22992 jmjC domain 22992 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 22993 HMG (high mobility group) box 22994 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 22994 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 22997 Fibronectin type III domain 22997 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 22998 LIM domain. This family represents two copies of the LIM structural domain 22999 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 23001 Beige/BEACH domain 23005 WD domain, G-beta repeat 23007 C2 domain 23007 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 23007 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 23008 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 23011 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 23017 Uncharacterized protein family UPF0005 23019 CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID 23024 TRAF-type zinc finger 23024 Zinc finger, C3HC4 type (RING finger) 23024 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 23025 C2 domain 23028 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 23028 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 23029 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 23030 jmjN domain 23030 jmjC domain 23031 Protein kinase domain 23031 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 23035 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 23040 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 23043 Protein kinase domain 23043 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 23048 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 23049 Phosphatidylinositol 3- and 4-kinase 23049 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 23049 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 23049 Phosphatidylinositol 3- and 4-kinase 23049 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 23049 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 23058 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 23062 VHS domain. Domain present in VPS-27, Hrs and STAM 23062 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 23062 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 23062 VHS domain. Domain present in VPS-27, Hrs and STAM 23063 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 23067 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 23070 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 23070 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 23071 Calsequestrin 23071 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 23075 PH domain. PH stands for pleckstrin homology 23081 jmjC domain 23081 jmjN domain 23087 B-box zinc finger 23087 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 23087 B-box zinc finger 23089 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 23091 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 23094 Rap/ran-GAP 23094 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 23095 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 23096 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 23097 Protein kinase domain 23099 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23105 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 23105 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 23108 Rap/ran-GAP 23111 MIT domain 23113 Cullin family 23113 Anaphase-promoting complex, subunit 10 (APC10) 23113 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 23114 Fibronectin type III domain 23114 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 23118 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 23118 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 23119 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23119 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 23119 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 23119 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 23120 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 23125 CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are ass 23125 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 23126 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 23127 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va 23129 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 23133 jmjC domain 23133 jmjC domain 23133 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 23135 jmjC domain 23136 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 23142 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 23142 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 23143 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 23145 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 23148 NAC domain 23150 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 23150 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 23151 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 23157 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 23157 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 23157 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 23157 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 23158 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 23158 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 23160 WD domain, G-beta repeat 23162 Myosin tail 23162 Uncharacterized ACR 23162 Ezrin/radixin/moesin family 23162 Intermediate filament protein 23163 VHS domain. Domain present in VPS-27, Hrs and STAM 23163 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 23163 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 23163 VHS domain. Domain present in VPS-27, Hrs and STAM 23163 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 23163 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 23168 Plus-3 domain. This domain is about 90 residues in length and is often found associated with the pfam02213 domain. The function of this domain is uncertain. It is possible that this domain is involved in DNA binding as it has three conserved positively c 23170 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 23171 NAD-dependent glycerol-3-phosphate dehydrogenase 23176 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 23179 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 23181 AMP-binding enzyme 23186 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 23186 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 23187 Uncharacterized ACR, COG1579 23187 PH domain. PH stands for pleckstrin homology 23187 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 23192 Peptidase family C54 23192 Peptidase family C54 23193 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 23194 F-box domain 23197 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 23200 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 23200 E1-E2 ATPase 23207 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 23210 jmjC domain 23213 Sulfatase 23218 Beige/BEACH domain 23218 WD domain, G-beta repeat 23219 F-box domain 23220 Zinc finger, C3HC4 type (RING finger) 23220 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 23221 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23224 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 23224 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 23225 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 23228 PH domain. PH stands for pleckstrin homology 23232 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 23233 Exocyst complex subunit Sec15-like 23234 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 23235 Protein kinase domain 23236 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 23236 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 23236 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 23236 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 23237 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 23239 PH domain. PH stands for pleckstrin homology 23239 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 23243 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 23249 Thrombospondin type 1 domain 23250 E1-E2 ATPase 23254 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 23256 Sec1 family 23256 Sec1 family 23258 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 23258 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 23258 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 23258 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 23258 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 23259 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 23259 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 23261 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 23261 CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are asso 23262 Histidine acid phosphatase 23265 Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocys 23266 Galactose binding lectin domain 23266 7 transmembrane receptor (Secretin family) 23266 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 23266 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou 23268 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 23268 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 23268 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 23269 Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome 23270 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 23271 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 23273 CDC45-like protein. CDC45 is an essential gene required for initiation of DNA replication in S. cerevisiae, forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human, mouse and smut fungus among others 23276 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 23283 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 23284 Galactose binding lectin domain 23284 7 transmembrane receptor (Secretin family) 23284 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 23284 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou 23294 Phosphotyrosine interaction domain (PTB/PID) 23294 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 23294 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 23302 Sulfotransferase protein 23302 WSC domain. This domain may be involved in carbohydrate binding 23303 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 23307 Uncharacterized ACR, COG1579 23307 Intermediate filament protein 23307 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 23307 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 23307 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 23307 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 23307 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 23309 Paired amphipathic helix repeat. This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-lo 23314 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 23315 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 23316 Homeobox domain 23316 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 23316 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 23317 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 23318 PAP/25A associated domain 23318 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 23322 C2 domain 23322 Microtubule associated protein (MAP65/ASE1 family) 23322 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 23324 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 23326 Ubiquitin carboxyl-terminal hydrolase family 2 23326 Ubiquitin carboxyl-terminal hydrolases family 2 23326 Zn-finger in ubiquitin-hydrolases and other protein 23327 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 23329 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 23336 Intermediate filament protein 23336 Intermediate filament protein 23339 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 23339 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 23341 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 23341 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 23345 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 23347 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 23350 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 23350 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 23353 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 23358 Ubiquitin carboxyl-terminal hydrolase family 2 23358 Ubiquitin carboxyl-terminal hydrolases family 2 23361 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 23362 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 23363 Fibronectin type III domain 23363 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 23368 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou 23380 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 23380 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 23382 S-adenosyl-L-homocysteine hydrolase 23384 Uncharacterized ACR, COG1579 23384 Intermediate filament protein 23384 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 23384 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 23386 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 23396 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 23399 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 23405 Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contai 23405 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 23405 Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contai 23405 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 23408 Sir2 family 23409 Sir2 family 23410 Sir2 family 23411 Sir2 family 23415 Cyclic nucleotide-binding domain 23415 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 23415 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 23416 Cyclic nucleotide-binding domain 23416 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 23416 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 23423 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 23424 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 23426 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 23431 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 23432 7 transmembrane receptor (rhodopsin family) 23432 7 transmembrane receptor (rhodopsin family) 23439 Sodium / potassium ATPase beta chain 23440 Homeobox domain 23443 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 23450 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 23458 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 23462 Helix-loop-helix DNA-binding domain 23463 Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this met 23463 Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this met 23468 'chromo' (CHRromatin Organization MOdifier) domain 23468 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 23471 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 23473 MIT domain 23478 Signal peptidase I 23479 NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do 23484 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei 23492 'chromo' (CHRromatin Organization MOdifier) domain 23493 Helix-loop-helix DNA-binding domain 23499 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 23499 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 23499 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 23503 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 23504 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 23510 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 23515 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 23516 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 23521 Ribosomal protein L13 23522 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 23526 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 23526 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 23527 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 23529 Ciliary neurotrophic factor 23530 NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide 23538 7 transmembrane receptor (rhodopsin family) 23539 Monocarboxylate transporter 23541 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 23541 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 23542 Phosphotyrosine interaction domain (PTB/PID) 23542 Phosphotyrosine interaction domain (PTB/PID) 23542 Phosphotyrosine interaction domain (PTB/PID) 23544 Sushi domain (SCR repeat) 23545 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 23546 Synaptogyrin. This family of proteins is distantly related to pfam01284 23549 Aminopeptidase I zinc metalloprotease (M18) 23550 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 23553 Hyaluronidase 23554 Tetraspanin family 23555 Tetraspanin family 23558 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 23562 PMP-22/EMP/MP20/Claudin family 23562 PMP-22/EMP/MP20/Claudin family 23564 Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme 23566 7 transmembrane receptor (rhodopsin family) 23569 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 23576 Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme 23581 ICE-like protease (caspase) p10 domain 23583 Uracil DNA glycosylase superfamily 23592 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 23596 7 transmembrane receptor (rhodopsin family) 23597 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 23598 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23598 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23598 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23598 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23607 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 23613 MYND finger 23613 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 23616 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 23620 7 transmembrane receptor (rhodopsin family) 23621 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 23621 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 23621 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 23621 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 23623 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 23624 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 23624 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co 23624 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser 23627 Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the n 23632 Eukaryotic-type carbonic anhydrase 23643 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 23649 DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner 23658 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 23659 Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol 23660 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 23660 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 23677 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 23678 PX domain. PX domains bind to phosphoinositides 23678 PX domain. PX domains bind to phosphoinositides 23683 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 23705 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 23710 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 23732 DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions 23743 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 23760 Phosphatidylinositol transfer protein 23762 Oxysterol-binding protein 23764 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 23764 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 23768 Fibronectin type III domain 23768 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 23768 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 23770 FKBP-type peptidyl-prolyl cis-trans isomerase 23775 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 23775 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 23779 RhoGAP domain 23786 Apoptosis regulator proteins 23787 Mitochondrial carrier protein 23788 Mitochondrial carrier protein 23792 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 23792 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 23792 Disintegrin 23792 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 23792 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 23793 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 23793 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 23794 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 23794 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 23796 7 transmembrane receptor (rhodopsin family) 23801 Lipoxygenase 23801 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 23802 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 23805 AP endonuclease family 1 23806 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 23807 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 23821 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 23825 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 23827 Inositol monophosphatase family 23831 Eukaryotic-type carbonic anhydrase 23832 7 transmembrane receptor (rhodopsin family) 23835 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 23836 Cadherin domain 23836 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 23856 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 23856 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 23857 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 23871 Ets-domain 23871 Sterile alpha motif (SAM)/Pointed domain 23871 Ets-domain 23871 Sterile alpha motif (SAM)/Pointed domain 23872 Ets-domain 23882 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 23885 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 23887 Gamma-glutamyltranspeptidase 23888 Glypican 23890 7 transmembrane receptor (rhodopsin family) 23892 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 23893 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 23894 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex 23917 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 23918 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 23920 Fibronectin type III domain 23920 Furin-like cysteine rich region 23920 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 23925 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be 23925 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be 23934 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 23936 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 23937 Mab-21 protein 23942 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 23943 C2 domain 23947 B-box zinc finger 23948 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 23948 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 23948 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 23957 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 23958 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 23958 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 23962 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 23964 Plant PEC family metallothionein 23970 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 23971 Adenylylsulfate kinase. Enzyme that catalyses the phosphorylation of adenylylsulfate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif 23971 ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for P 23972 Adenylylsulfate kinase. Enzyme that catalyses the phosphorylation of adenylylsulfate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif 23972 ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for P 23980 Phosphatidylethanolamine-binding protein 23984 3'5'-cyclic nucleotide phosphodiesterase 23985 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 23986 Acyl CoA binding protein 23988 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 23988 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 23992 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 24012 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 24012 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 24013 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 24014 Protein kinase domain 24017 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 24018 mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme 24018 mRNA capping enzyme, C-terminal domain 24018 mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme 24030 Ribosomal protein S12 24044 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 24045 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 24046 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 24050 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 24058 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 24059 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 24059 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 24060 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 24067 SRP54-type protein, helical bundle domain 24067 pfam02881, SRP54_N, SRP54-type protein, helical bundle domain 24067 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins 24070 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 24084 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 24086 Protein kinase domain 24086 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 24087 CUB domain 24087 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 24088 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 24105 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 24108 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 24109 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 24112 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 24113 Homeobox domain 24115 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 24132 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 24136 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 24139 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes 24140 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 24140 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 24140 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 24141 Lysosome-associated membrane glycoprotein (Lamp) 24146 PMP-22/EMP/MP20/Claudin family 24153 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 24153 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 24162 Histidine acid phosphatase 24165 Adenosine/AMP deaminase 24166 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 24167 7 transmembrane receptor (Secretin family) 24170 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 24171 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 24173 7 transmembrane receptor (rhodopsin family) 24174 7 transmembrane receptor (rhodopsin family) 24175 7 transmembrane receptor (rhodopsin family) 24176 7 transmembrane receptor (rhodopsin family) 24177 Serum albumin family 24179 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 24180 7 transmembrane receptor (rhodopsin family) 24182 7 transmembrane receptor (rhodopsin family) 24183 Adenylate kinase 24184 Adenylate kinase 24185 PH domain. PH stands for pleckstrin homology 24186 Serum albumin family 24188 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 24189 Fructose-bisphosphate aldolase class-I 24190 Fructose-bisphosphate aldolase class-I 24191 Fructose-bisphosphate aldolase class-I 24192 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 24197 Alkaline phosphatase 24203 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe 24203 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st 24208 Androgen receptor 24208 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 24208 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 24210 Hepatic lectin, N-terminal domain 24210 Lectin C-type domain. This family includes both long and short form C-type 24214 Sodium / potassium ATPase beta chain 24216 E1-E2 ATPase 24216 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 24216 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 24216 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 24217 Sodium / potassium ATPase beta chain 24218 Heavy-metal-associated domain 24221 Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5 24224 Apoptosis regulator proteins, Bcl-2 family 24225 Nerve growth factor family 24227 Zinc finger, C3HC4 type (RING finger) 24227 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 24230 TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia 24231 Sushi domain (SCR repeat) 24231 von Willebrand factor type A domain 24231 Trypsin 24232 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 24232 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 24232 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 24232 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 24233 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 24233 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 24233 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 24233 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 24233 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 24233 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 24233 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo 24233 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 24235 Sushi domain (SCR repeat) 24236 Sushi domain (SCR repeat) 24237 Sushi domain (SCR repeat) 24237 Thrombospondin type 1 domain 24237 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 24239 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 24247 7 transmembrane receptor (metabotropic glutamate family) 24247 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24248 Catalase 24249 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 24249 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 24251 Tetraspanin family 24254 Carboxylesterase 24255 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 24255 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 24258 Granin (chromogranin or secretogranin) 24259 Granin (chromogranin or secretogranin) 24260 7 transmembrane receptor (rhodopsin family) 24261 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 24261 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 24264 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 24264 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 24265 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 24265 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 24268 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 24269 Zinc carboxypeptidase 24269 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 24271 Zinc carboxypeptidase 24271 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 24273 Hsp20/alpha crystallin family 24273 Alpha crystallin A chain, N terminal 24277 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 24278 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 24279 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 24279 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 24282 Somatotropin hormone family 24284 Casein 24293 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24294 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24296 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24297 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24298 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24299 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24303 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24303 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24307 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24310 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 24311 Pyridoxal-dependent decarboxylase conserved domain 24312 Dihydrofolate reductase 24314 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 24315 Somatotropin hormone family 24316 7 transmembrane receptor (rhodopsin family) 24318 7 transmembrane receptor (rhodopsin family) 24323 Endothelin family 24324 Endothelin family 24326 7 transmembrane receptor (rhodopsin family) 24329 Furin-like cysteine rich region 24329 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 24331 Trypsin 24334 Enolase, N-terminal domain 24334 Enolase, C-terminal TIM barrel domain 24335 Erythropoietin/thrombopoietin 24337 Furin-like cysteine rich region 24337 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 24346 Carboxylesterase 24356 Ets-domain 24360 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 24361 Fanconi anaemia group C protein 24362 Fructose-1-6-bisphosphatase 24366 Fibrinogen beta and gamma chains, C-terminal globular domain 24366 Fibrinogen beta and gamma chains, C-terminal globular domain 24367 Fibrinogen beta and gamma chains, C-terminal globular domain 24367 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits 24370 Fork head domain 24375 Alpha-L-fucosidase 24379 Pyridoxal-dependent decarboxylase conserved domain 24380 Pyridoxal-dependent decarboxylase conserved domain 24385 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 24387 Intermediate filament protein 24392 Connexin 24392 Gap junction alpha-1 protein (Cx43) 24397 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 24397 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 24404 Glutathione peroxidase 24406 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24407 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24407 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24408 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24409 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24410 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24411 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24412 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24413 Glucocorticoid receptor 24413 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 24413 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 24414 7 transmembrane receptor (metabotropic glutamate family) 24414 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24415 7 transmembrane receptor (metabotropic glutamate family) 24415 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24416 7 transmembrane receptor (metabotropic glutamate family) 24416 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24417 7 transmembrane receptor (metabotropic glutamate family) 24417 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24418 7 transmembrane receptor (metabotropic glutamate family) 24418 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24419 7 transmembrane receptor (metabotropic glutamate family) 24419 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24419 7 transmembrane receptor (metabotropic glutamate family) 24419 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24420 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 24420 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 24420 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 24421 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 24422 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 24423 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 24424 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 24426 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 24434 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 24434 Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold 24437 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 24438 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 24439 Metallo-beta-lactamase superfamily 24440 Globin 24443 Pyridoxal-dependent decarboxylase conserved domain 24443 Pyridoxal-dependent decarboxylase conserved domain 24446 Trypsin 24446 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 24448 7 transmembrane receptor (rhodopsin family) 24450 Hydroxymethylglutaryl-coenzyme A synthase 24450 Hydroxymethylglutaryl-coenzyme A synthase 24451 Heme oxygenase 24452 Homeobox domain 24456 Homeobox domain 24457 Homeobox domain 24459 Homeobox domain 24464 Sushi domain (SCR repeat) 24465 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 24468 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 24470 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 24471 Hsp20/alpha crystallin family 24472 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 24473 7 transmembrane receptor (rhodopsin family) 24481 Interferon alpha/beta domain 24482 Insulin/IGF/Relaxin family. Superfamily includes insulins 24484 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 24494 Interleukin-1 propeptide. The Interleukin-1 cytokines are translated as precursor proteins. The N terminal approx. 115 amino acids form a propeptide that is cleaved off to release the active interleukin-1 24495 Interleukin-3 24497 Interleukin 5 24503 Intermediate filament protein 24504 Transforming growth factor beta like domain 24505 Insulin/IGF/Relaxin family. Superfamily includes insulins 24506 Insulin/IGF/Relaxin family. Superfamily includes insulins 24508 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp 24511 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 24516 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 24516 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 24517 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 24517 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 24518 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 24518 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 24520 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 24520 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 24520 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 24520 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 24521 Inward rectifier potassium channel 24523 Trypsin 24528 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme 24530 Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol 24538 Lipase 24538 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 24539 Lipase 24539 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 24546 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 24546 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 24547 Myelin basic protein 24548 Lectin C-type domain. This family includes both long and short form C-type 24553 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 24553 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 24553 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 24559 Protein kinase domain 24561 Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA 24566 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 24566 PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some 24575 Dynamin GTPase effector domain 24575 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 24575 Dynamin family 24575 Dynamin GTPase effector domain 24575 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 24577 Helix-loop-helix DNA-binding domain 24577 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 24581 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 24583 Myosin head (motor domain) 24586 Fibronectin type III domain 24587 Intermediate filament protein 24588 Intermediate filament protein 24596 TNFR/NGFR cysteine-rich region 24598 Flavodoxin 24598 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 24598 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 24599 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 24600 Flavodoxin 24600 FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase 24602 Atrial natriuretic peptide 24603 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 24604 Pancreatic hormone peptide 24609 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 24613 7 transmembrane receptor (rhodopsin family) 24614 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 24615 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 24616 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 24617 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 24618 Lectin C-type domain. This family includes both long and short form C-type 24620 Lectin C-type domain. This family includes both long and short form C-type 24624 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 24626 3'5'-cyclic nucleotide phosphodiesterase 24627 3'5'-cyclic nucleotide phosphodiesterase 24638 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 24638 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 24640 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 24640 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 24644 Phosphoglycerate kinase 24648 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 24649 Protein kinase domain 24651 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 24654 C2 domain 24654 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 24654 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 24655 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 24656 Somatotropin hormone family 24658 Somatotropin hormone family 24660 PMP-22/EMP/MP20/Claudin family 24663 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 24664 Corticotropin ACTH domain 24677 Pancreatic hormone peptide 24679 Cyclic nucleotide-binding domain 24681 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 24683 Somatotropin hormone family 24684 Fibronectin type III domain 24686 Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the no 24688 Intermediate filament protein 24693 Animal haem peroxidase 24693 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 24694 Parathyroid hormone family 24695 Parathyroid hormone family 24699 Fibronectin type III domain 24699 Fibronectin type III domain 24699 Protein-tyrosine phosphatase 24701 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 24703 Protein kinase domain 24703 Raf-like Ras-binding domain 24703 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 24705 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 24705 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 24708 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 24708 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserved 24710 Lipocalin / cytosolic fatty-acid binding protein family 24711 Interphotoreceptor retinoid-binding protein. Interphotoreceptor retinoid-binding protein (IRBP) mediates retinoid trafficking between the photoreceptors and pigmented epithelium. This Pfam family represents a repeat domain found four times in the mammalia 24715 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 24717 7 transmembrane receptor (rhodopsin family) 24718 Reeler domain 24737 Class I Histocompatibility antigen, domains alpha 1 and 2 24747 Class I Histocompatibility antigen, domains alpha 1 and 2 24748 Class I Histocompatibility antigen, domains alpha 1 and 2 24765 Granin (chromogranin or secretogranin) 24766 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 24767 Amiloride-sensitive sodium channel 24768 Amiloride-sensitive sodium channel 24769 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 24770 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 24772 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 24773 Lectin C-type domain. This family includes both long and short form C-type 24775 Laminin G domain 24777 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 24778 Sugar (and other) transporter 24779 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 24780 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 24781 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 24782 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 24783 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 24784 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 24785 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 24785 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 24786 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 24791 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 24794 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 24795 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 24797 Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som 24800 Sulfatase 24803 Synaptobrevin 24804 Synaptophysin / synaptoporin 24807 7 transmembrane receptor (rhodopsin family) 24808 7 transmembrane receptor (rhodopsin family) 24816 7 transmembrane receptor (rhodopsin family) 24818 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 24822 Uncharacterized protein family UPF0005 24825 Transferrin 24826 Carboxylesterase 24826 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 24831 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 24831 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 24833 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 24842 P53 24848 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 24848 ACT domain. This family of domains generally have a regulatory role. ACT domains are linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to r 24849 Triosephosphate isomerase 24851 Tropomyosin 24852 Tropomyosin 24854 Clusterin 24854 Clusterin 24855 Tuberin 24855 Rap/ran-GAP 24856 Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders 24860 Mitochondrial carrier protein 24861 UDP-glucoronosyl and UDP-glucosyl transferase 24862 UDP-glucoronosyl and UDP-glucosyl transferase 24864 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 24873 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 24873 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 24874 von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA 24875 7 transmembrane receptor (Secretin family) 24875 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 24881 wnt family 24882 wnt family 24883 Wilm's tumour protein 24886 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24887 Apoptosis regulator proteins, Bcl-2 family 24888 Apoptosis regulator proteins, Bcl-2 family 24889 7 transmembrane receptor (rhodopsin family) 24890 Oestrogen receptor 24890 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 24890 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 24894 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24895 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 24896 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 24898 Sodium:neurotransmitter symporter family 24902 Sulfotransferase protein 24903 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 24904 Sugar (and other) transporter 24906 Phospholipase A2 inhibitor 24907 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 24912 Sulfotransferase protein 24912 Sulfotransferase protein 24913 NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins 24914 Lysyl oxidase 24915 LIM domain. This family represents two copies of the LIM structural domain 24915 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 24918 SH2 domain 24918 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 24918 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 24918 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 24922 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 24922 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 24923 Syntaxin 24924 PTN/MK heparin-binding protein family 24925 7 transmembrane receptor (rhodopsin family) 24929 7 transmembrane receptor (rhodopsin family) 24933 Lectin C-type domain. This family includes both long and short form C-type 24936 Tetraspanin family 24938 7 transmembrane receptor (rhodopsin family) 24941 Heavy-metal-associated domain 24941 Heavy-metal-associated domain 24942 GDP dissociation inhibitor 24943 Myelin proteolipid protein (PLP or lipophilin) 24944 Lysosome-associated membrane glycoprotein (Lamp) 24946 Trypsin 24947 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 24947 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 24948 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 24949 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 24950 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 24952 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 24953 7 transmembrane receptor (Secretin family) 24953 7 transmembrane receptor (Secretin family) 24954 Fibronectin type III domain 24954 Furin-like cysteine rich region 24954 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 24956 MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also 24957 Glutamine synthetase, catalytic domain 24962 Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA 24967 Proteasome A-type and B-type 25006 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25006 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25007 7 transmembrane receptor (rhodopsin family) 25008 TNF(Tumor Necrosis Factor) family 25009 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 25010 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 25011 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25014 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 25017 Myosin head (motor domain) 25021 von Willebrand factor type A domain 25022 Protein kinase domain 25023 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 25024 7 transmembrane receptor (Secretin family) 25026 Adrenomedullin 25028 Adenosine/AMP deaminase 25029 7 transmembrane receptor (Secretin family) 25031 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 25033 7 transmembrane receptor (rhodopsin family) 25035 Oxidoreductase FAD-binding domain 25040 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 25041 ATP P2X receptor 25045 Acyl CoA binding protein 25051 7 transmembrane receptor (Secretin family) 25052 Trypsin 25053 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25055 ab-hydrolase associated lipase region 25056 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25058 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 25059 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 25060 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 25060 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF03727. Some members of the family have two copies of each of these domains 25061 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25073 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 25075 7 transmembrane receptor (rhodopsin family) 25080 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 25081 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 25082 BRCA2 repeat. The alignment covers only the most conserved region of the repeat 25083 7 transmembrane receptor (rhodopsin family) 25085 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 25086 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25087 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 25095 Adenosine/AMP deaminase 25096 PX domain. PX domains bind to phosphoinositides 25097 PX domain. PX domains bind to phosphoinositides 25098 Fork head domain 25099 Fork head domain 25100 Fork head domain 25101 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25101 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25102 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25102 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25103 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25103 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25105 Atrial natriuretic peptide 25106 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 25107 7 transmembrane receptor (rhodopsin family) 25108 7 transmembrane receptor (rhodopsin family) 25111 7 transmembrane receptor (rhodopsin family) 25112 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 25116 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 25117 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 25118 von Willebrand factor type A domain 25121 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 25122 Amiloride-sensitive sodium channel 25124 SH2 domain 25124 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 25124 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 25124 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 25125 SH2 domain 25125 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 25125 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa 25125 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 25126 SH2 domain 25126 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T 25126 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA 25128 Zona pellucida-like domain 25130 Syntaxin 25132 DIL domain. The DIL domain has no known function 25136 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 25139 Sugar (and other) transporter 25146 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25147 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25149 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25149 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25150 SH2 domain 25150 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 25151 Cation-independent mannose-6-phosphate receptor repeat. The cation-independent mannose-6-phosphate receptor contains 15 copies of a repeat 25153 7 transmembrane receptor (rhodopsin family) 25154 Progesterone receptor 25154 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25155 SH2 domain 25156 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 25156 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 25159 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d 25162 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 25165 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 25166 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 25169 Tetraspanin family 25176 Mitochondrial carrier protein 25183 GDP dissociation inhibitor 25186 Ephrin 25187 7 transmembrane receptor (rhodopsin family) 25192 Lectin C-type domain. This family includes both long and short form C-type 25193 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 25195 7 transmembrane receptor (rhodopsin family) 25196 Transforming growth factor beta like domain 25196 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 25197 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 25197 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 25202 Adenylate and Guanylate cyclase catalytic domain 25204 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 25216 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 25218 Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 25223 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 25227 Sulfatase 25229 7 transmembrane receptor (rhodopsin family) 25230 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 25231 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 25232 Fibronectin type III domain 25236 Glypican 25237 Protein kinase domain 25237 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 25238 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 25240 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 25241 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 25242 Helix-loop-helix DNA-binding domain 25243 Helix-loop-helix DNA-binding domain 25244 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 25244 Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto 25245 7 transmembrane receptor (rhodopsin family) 25245 7 transmembrane receptor (rhodopsin family) 25248 7 transmembrane receptor (rhodopsin family) 25250 Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII 25251 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25253 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 25254 Sodium:sulfate symporter transmembrane region 25256 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 25259 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 25261 Helix-loop-helix DNA-binding domain 25264 7 transmembrane receptor (rhodopsin family) 25265 7 transmembrane receptor (rhodopsin family) 25266 Platelet-derived growth factor (PDGF) 25266 Platelet-derived growth factor (PDGF) 25268 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 25268 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 25271 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25271 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25273 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 25274 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 25275 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme contai 25277 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 25277 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 25278 Cytochrome c oxidase subunit VIa 25280 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 25281 Zn-finger in Ran binding protein and others 25283 Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. T 25285 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 25289 Adenylate and Guanylate cyclase catalytic domain 25290 Lipoxygenase 25291 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 25293 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 25294 Class I Histocompatibility antigen, domains alpha 1 and 2 25295 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 25296 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 25297 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 25298 Gastrin/cholecystokinin family 25302 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25302 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25307 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 25308 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 25311 Fibronectin type III domain 25314 PMP-22/EMP/MP20/Claudin family 25315 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 25316 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 25317 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 25319 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 25320 Gastrin/cholecystokinin family 25321 7 transmembrane receptor (Secretin family) 25322 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 25323 7 transmembrane receptor (rhodopsin family) 25324 7 transmembrane receptor (rhodopsin family) 25325 Interleukin 10 25327 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 25327 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25328 Lysosome-associated membrane glycoprotein (Lamp) 25334 Helix-loop-helix DNA-binding domain 25335 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 25336 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 25339 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 25340 7 transmembrane receptor (rhodopsin family) 25342 7 transmembrane receptor (rhodopsin family) 25343 Phosducin 25344 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 25344 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 25346 Fibronectin type III domain 25350 Lectin C-type domain. This family includes both long and short form C-type 25351 Sugar (and other) transporter 25352 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 25353 Osteopontin 25354 7 transmembrane receptor (rhodopsin family) 25355 Sulfotransferase protein 25356 Fibronectin type III domain 25359 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 25360 7 transmembrane receptor (rhodopsin family) 25364 Amiloride-sensitive sodium channel 25365 Actin 25366 Protein kinase domain 25366 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t 25367 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 25368 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 25369 7 transmembrane receptor (rhodopsin family) 25370 7 transmembrane receptor (rhodopsin family) 25372 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 25373 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 25374 Delta-aminolevulinic acid dehydratase 25375 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 25377 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25380 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 25382 Amyloid A4 extracellular domain 25382 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 25383 Cytidine and deoxycytidylate deaminase zinc-binding region 25385 TNF(Tumor Necrosis Factor) family 25386 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 25387 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 25387 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 25388 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 25388 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 25390 Sodium / potassium ATPase beta chain 25393 Extracellular link domain 25394 Intermediate filament protein 25397 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 25397 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 25397 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo 25398 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25399 Cache domain 25401 OAR domain 25401 Homeobox domain 25403 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 25404 Caveolin 25405 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 25406 Extracellular link domain 25409 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 25411 Cyclic nucleotide-binding domain 25411 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25413 Choline/Carnitine o-acyltransferase 25415 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 25416 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 25416 N-acetylglucosamine-6-phosphate deacetylase. This family are predicted to adopt a TIM barrel fold. This is family of N-acetylglucosamine-6-phosphate deacetylases, EC:3.5.1.25 These enzymes catalyse the reaction N-acetyl-D-glucosamine 6-phosphate + H2O <=> 25418 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 25420 Hsp20/alpha crystallin family 25420 Alpha crystallin A chain, N terminal 25421 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 25422 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 25423 Papain family cysteine protease 25424 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 25425 Papain family cysteine protease 25426 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25427 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25428 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25429 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25430 Iodothyronine deiodinase 25431 Homeobox domain 25432 7 transmembrane receptor (rhodopsin family) 25434 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 25435 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 25437 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 25439 7 transmembrane receptor (rhodopsin family) 25440 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25444 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 25447 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 25449 7 transmembrane receptor (rhodopsin family) 25450 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25450 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25451 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25451 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25453 Transforming growth factor beta like domain 25454 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 25456 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25456 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25457 7 transmembrane receptor (rhodopsin family) 25458 Eukaryotic glutathione synthase 25458 Eukaryotic glutathione synthase, ATP binding domain 25461 7 transmembrane receptor (rhodopsin family) 25462 Chaperonin 10 Kd subunit 25464 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 25466 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 25467 PTB domain (IRS-1 type) 25468 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 25468 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25469 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 25469 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25470 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 25470 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25471 Slow voltage-gated potassium channel 25472 Inward rectifier potassium channel 25473 Laminin N-terminal (Domain VI) 25473 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 25474 Galactoside-binding lectin 25477 7 transmembrane receptor (rhodopsin family) 25479 Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps 25480 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 25481 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 25481 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 25483 Homeobox domain 25484 Myosin head (motor domain) 25484 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 25486 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 25489 C2 domain 25489 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 25489 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 25490 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 25491 Intermediate filament protein 25492 CTF/NF-I family 25493 Coatomer WD associated domain 25493 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 25497 RanBP1 domain 25498 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 25502 Ornithine decarboxylase antizyme 25504 Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5 25505 ATP P2X receptor 25506 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 25507 Giardia variant-specific surface protein 25507 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 25508 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 25508 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 25509 Homeobox domain 25509 'Paired box' domain 25511 Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids 25513 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 25514 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 25515 POLO box duplicated region 25516 Vertebrate endogenous opioids neuropeptide 25518 Cyclophilin type peptidyl-prolyl cis-trans isomerase 25522 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 25524 Saposin A-type domain 25525 Transforming growth factor beta like domain 25526 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25527 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25527 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25529 Fibronectin type III domain 25534 Tetraspanin family 25538 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 25539 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 25539 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 25540 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 25541 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 25542 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 25544 Lectin C-type domain. This family includes both long and short form C-type 25546 Homeobox domain 25547 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 25548 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 25550 Sodium:dicarboxylate symporter family 25551 Sugar (and other) transporter 25552 Sodium:solute symporter family 25553 Sodium:neurotransmitter symporter family 25553 Serotonin (5-HT) neurotransmitter transporter, N-terminus 25555 7 transmembrane receptor (rhodopsin family) 25558 Sec1 family 25563 Trefoil (P-type) domain 25564 TSC-22/dip/bun family 25565 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 25566 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 25567 Fibronectin type III domain 25570 7 transmembrane receptor (rhodopsin family) 25571 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 25572 TNFR/NGFR cysteine-rich region 25574 HMG (high mobility group) box 25575 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 25576 14-3-3 protein 25577 14-3-3 protein 25578 14-3-3 protein 25579 Protein kinase domain 25583 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 25584 Tissue factor 25586 Alkaline phosphatase 25587 Helix-loop-helix DNA-binding domain 25588 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 25590 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25590 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25591 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc 25591 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 25591 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 25592 Laminin G domain 25592 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 25592 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 25592 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 25592 Laminin G domain 25592 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 25592 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 25592 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 25593 7 transmembrane receptor (rhodopsin family) 25595 Tau and MAP protein, tubulin-binding repeat 25597 PH domain. PH stands for pleckstrin homology 25597 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 25597 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 25598 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25600 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 25601 7 transmembrane receptor (rhodopsin family) 25604 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 25605 Orexin receptor type 2 25605 7 transmembrane receptor (rhodopsin family) 25607 Homeobox domain 25608 Leptin 25609 Tub family 25610 Interleukin-6/G-CSF/MGF family 25613 Fibronectin type III domain 25613 Eukaryotic-type carbonic anhydrase 25614 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 25615 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 25616 Insulin/IGF/Relaxin family. Superfamily includes insulins 25617 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 25619 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 25620 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 25620 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 25621 Tetraspanin family 25622 SH2 domain 25623 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 25624 Synaptobrevin 25625 TNFR/NGFR cysteine-rich region 25625 TNFR/NGFR cysteine-rich region 25626 Intermediate filament protein 25626 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 25626 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 25627 Trypsin 25629 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 25631 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 25631 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 25632 Globin 25635 7 transmembrane receptor (rhodopsin family) 25637 7 transmembrane receptor (rhodopsin family) 25638 3'5'-cyclic nucleotide phosphodiesterase 25641 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 25642 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 25643 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 25644 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 25644 Transforming growth factor beta like domain 25644 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 25645 7 transmembrane receptor (rhodopsin family) 25646 Homeobox domain 25646 Otx1 transcription factor 25650 Sodium / potassium ATPase beta chain 25651 Sushi domain (SCR repeat) 25651 Lectin C-type domain. This family includes both long and short form C-type 25652 7 transmembrane receptor (rhodopsin family) 25654 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 25655 Connexin 25656 Guanylin precursor 25657 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25659 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es 25661 Fibronectin type I domain 25661 Fibronectin type III domain 25662 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 25663 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 25664 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25664 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25665 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25666 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 25668 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 25669 Zinc carboxypeptidase 25670 Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses 25671 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 25671 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 25672 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25672 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25673 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 25674 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25674 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25675 Hydroxymethylglutaryl-coenzyme A reductase 25675 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 25676 SH2 domain 25676 PH domain. PH stands for pleckstrin homology 25676 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 25678 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 25679 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 25679 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 25681 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 25682 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25682 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25684 MAM domain. An extracellular domain found in many receptors 25684 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 25684 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 25684 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 25685 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 25686 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 25687 Caldesmon 25688 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 25689 7 transmembrane receptor (rhodopsin family) 25690 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 25690 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 25691 Fibronectin type III domain 25691 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 25692 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 25696 Low-density lipoprotein receptor domain class A 25696 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 25698 Arginosuccinate synthase. This family contains a PP-loop motif 25699 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 25699 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 25701 Helix-loop-helix DNA-binding domain 25702 Lipase 25702 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 25703 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 25704 Sushi domain (SCR repeat) 25705 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 25706 7 transmembrane receptor (rhodopsin family) 25707 Ciliary neurotrophic factor 25708 Mitochondrial carrier protein 25711 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 25712 Interferon gamma 25713 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 25714 Helix-loop-helix DNA-binding domain 25714 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 25715 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined 25716 C2 domain 25717 Transforming growth factor beta like domain 25717 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 25718 Fibronectin type III domain 25718 Furin-like cysteine rich region 25718 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 25720 Helix-loop-helix DNA-binding domain 25722 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25723 Prepro-orexin 25724 Fibronectin type III domain 25724 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 25725 Cyclic nucleotide-binding domain 25726 7 transmembrane receptor (rhodopsin family) 25727 MAM domain. An extracellular domain found in many receptors 25727 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 25727 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 25728 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 25730 Nerve growth factor family 25731 C2 domain 25732 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 25734 SH2 domain 25734 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 25735 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25735 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25736 Kv2 voltage-gated K+ channel 25736 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 25736 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 25737 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 25738 SH2 domain 25738 PH domain. PH stands for pleckstrin homology 25738 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 25738 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 25739 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 25740 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 25741 Phosphofructokinase 25742 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 25743 Inward rectifier potassium channel 25744 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 25744 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 25745 Myosin head (motor domain) 25745 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 25747 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 25747 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 25748 Aminotransferase class I and II 25749 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 25750 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 25751 Dynamin family 25751 PH domain. PH stands for pleckstrin homology 25752 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 25753 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 25753 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 25756 Choline/Carnitine o-acyltransferase 25757 Choline/Carnitine o-acyltransferase 25759 SH2 domain 25759 Phosphotyrosine interaction domain (PTB/PID) 25769 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 25778 Protein kinase domain 25780 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 25780 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 25794 Fibroblast growth factor 25797 Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally im 25800 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 25802 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 25806 Homeobox domain 25807 UBA domain 25809 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 25816 Glucose inhibited division protein A 25819 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 25820 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 25821 Glucose inhibited division protein A 25821 Glucose inhibited division protein A 25823 Trypsin 25825 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 25825 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 25825 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 25828 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 25829 Domain of unknown function 25830 Sulfotransferase protein 25830 Sulfotransferase protein 25831 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 25833 Homeobox domain 25836 Caldesmon 25836 Caldesmon 25841 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 25843 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 25851 PH domain. PH stands for pleckstrin homology 25851 PH domain. PH stands for pleckstrin homology 25861 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 25862 Zn-finger in ubiquitin-hydrolases and other protein 25865 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 25870 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 25873 Ribosomal protein L36e 25873 Ribosomal protein L36e 25874 Uncharacterised protein family (UPF0041) 25878 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 25879 Sof1-like domain. Sof1 is essential for cell growth and is a component of the nucleolar rRNA processing machinery 25884 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 25885 RNA polymerase alpha subunit 25885 RNA polymerase A/beta'/A" subunit 25888 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 25893 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 25894 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 25897 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 25900 Intermediate filament protein 25900 Intermediate filament protein 25902 Formate--tetrahydrofolate ligase 25903 Olfactomedin-like domain 25912 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 25913 Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. A 25917 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 25921 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 25923 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 25924 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 25930 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 25943 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 25947 Sugar (and other) transporter 25948 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 25948 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 25953 Metallo-beta-lactamase superfamily 25953 Metallo-beta-lactamase superfamily 25959 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 25960 7 transmembrane receptor (Secretin family) 25960 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 25966 C2 domain 25975 MAM domain. An extracellular domain found in many receptors 25976 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 25976 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 25976 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 25976 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 25978 Eukaryotic protein of unknown function, DUF279 25981 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 25983 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 25984 Intermediate filament protein 25984 Intermediate filament protein 25989 MIT domain 25989 Protein kinase domain 25992 Fibronectin type III domain 25992 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 26002 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 26002 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 26009 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 26013 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 26013 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 26019 Up-frameshift suppressor 2. Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay. In Saccharomyces cerevisiae, three trans-acti 26019 Up-frameshift suppressor 2. Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay. In Saccharomyces cerevisiae, three trans-acti 26031 Oxysterol-binding protein 26031 Oxysterol-binding protein 26031 Oxysterol-binding protein 26031 Oxysterol-binding protein 26031 Oxysterol-binding protein 26031 Oxysterol-binding protein 26032 Extracellular link domain 26033 Lectin C-type domain. This family includes both long and short form C-type 26033 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 26037 Rap/ran-GAP 26038 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 26043 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 26047 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 26047 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 26048 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 26048 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 26048 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 26050 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 26052 Dynamin family 26052 Dynamin GTPase effector domain 26052 PH domain. PH stands for pleckstrin homology 26054 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 26060 PH domain. PH stands for pleckstrin homology 26063 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 26088 VHS domain. Domain present in VPS-27, Hrs and STAM 26088 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 26088 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 26103 Fibronectin type III domain 26108 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 26108 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 26130 Vacuolar sorting protein 9 (VPS9) domain 26130 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 26136 LIM domain. This family represents two copies of the LIM structural domain 26136 LIM domain. This family represents two copies of the LIM structural domain 26137 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 26137 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 26140 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 26147 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 26153 Kinesin motor domain 26155 Uncharacterised protein family (UPF0120) 26156 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 26157 GTPase of unknown function 26168 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 26184 7 transmembrane receptor (rhodopsin family) 26188 7 transmembrane receptor (rhodopsin family) 26189 7 transmembrane receptor (rhodopsin family) 26205 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 26207 Phosphatidylinositol transfer protein 26207 Phosphatidylinositol transfer protein 26211 7 transmembrane receptor (rhodopsin family) 26219 7 transmembrane receptor (rhodopsin family) 26223 F-box domain 26224 F-box domain 26225 ADP-ribosylation factor family 26225 ADP-ribosylation factor family 26229 Glycosyltransferase family 43 26232 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 26234 F-box domain 26234 F-box domain 26245 7 transmembrane receptor (rhodopsin family) 26249 BTB/POZ domain. The BTB (for BR-C 26249 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Kelch is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Sev 26251 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 26251 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 26257 Homeobox domain 26259 F-box domain 26261 F-box domain 26261 F-box domain 26263 F-box domain 26263 F-box domain 26266 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 26267 F-box domain 26268 F-box domain 26268 F-box domain 26268 F-box domain 26270 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 26272 F-box domain 26272 F-box domain 26273 F-box domain 26273 F-box domain 26276 Sec1 family 26279 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 26280 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 26281 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 26285 PMP-22/EMP/MP20/Claudin family 26286 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 26289 Adenylate kinase 26289 Adenylate kinase 26290 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 26295 TPR Domain 26295 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 26301 Glycosyltransferase family 6 26333 7 transmembrane receptor (rhodopsin family) 26338 7 transmembrane receptor (rhodopsin family) 26341 7 transmembrane receptor (rhodopsin family) 26353 Hsp20/alpha crystallin family 26354 GTPase of unknown function 26358 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 26359 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 26361 7 transmembrane receptor (rhodopsin family) 26362 Fibronectin type III domain 26363 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 26364 7 transmembrane receptor (Secretin family) 26364 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 26369 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 26372 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 26373 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 26373 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de 26377 SH2 domain 26377 PH domain. PH stands for pleckstrin homology 26378 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 26379 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 26380 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 26380 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 26381 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 26381 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 26384 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 26385 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 26386 HSF-type DNA-binding 26388 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 26388 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 26388 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 26394 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 26399 Protein kinase domain 26403 Protein kinase domain 26403 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 26404 Protein kinase domain 26406 Protein kinase domain 26409 Protein kinase domain 26411 Protein kinase domain 26412 Protein kinase domain 26412 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 26421 Somatotropin hormone family 26422 Beige/BEACH domain 26423 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 26423 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 26424 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 26424 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 26431 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 26434 Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the n 26440 Proteasome A-type and B-type 26445 Proteasome A-type and B-type 26446 Proteasome A-type and B-type 26447 impB/mucB/samB family. These proteins are involved in UV protection 26462 Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases 26462 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 26464 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 26468 Homeobox domain 26468 LIM domain. This family represents two copies of the LIM structural domain 26470 Sushi domain (SCR repeat) 26476 7 transmembrane receptor (rhodopsin family) 26492 7 transmembrane receptor (rhodopsin family) 26493 7 transmembrane receptor (rhodopsin family) 26496 7 transmembrane receptor (rhodopsin family) 26499 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 26502 pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain 26502 pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain 26508 Helix-loop-helix DNA-binding domain 26509 C2 domain 26509 C2 domain 26515 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 26517 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 26519 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 26520 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 26521 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 26523 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 26523 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 26526 Tetraspanin family 26529 7 transmembrane receptor (rhodopsin family) 26531 7 transmembrane receptor (rhodopsin family) 26532 7 transmembrane receptor (rhodopsin family) 26538 7 transmembrane receptor (rhodopsin family) 26539 7 transmembrane receptor (rhodopsin family) 26554 Cullin family 26556 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 26556 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 26557 Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35 26557 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 26557 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 26558 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 26561 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 26563 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 26564 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 26565 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 26568 AMP-binding enzyme 26569 AMP-binding enzyme 26575 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 26577 CUB domain 26581 Homeobox domain 26582 Homeobox domain 26583 Homeobox domain 26584 Homeobox domain 26585 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 26658 7 transmembrane receptor (rhodopsin family) 26659 7 transmembrane receptor (rhodopsin family) 26689 7 transmembrane receptor (rhodopsin family) 26692 7 transmembrane receptor (rhodopsin family) 26696 7 transmembrane receptor (rhodopsin family) 26707 7 transmembrane receptor (rhodopsin family) 26707 7 transmembrane receptor (rhodopsin family) 26707 7 transmembrane receptor (rhodopsin family) 26716 7 transmembrane receptor (rhodopsin family) 26750 MIT domain 26750 PX domain. PX domains bind to phosphoinositides 26757 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 26759 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 26873 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 26873 Hydantoinase B/oxoprolinase. This family includes N-methylhydaintoinase B which converts hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase EC:3.5.2.9 which catalyses the formation of L-glutamate from 5-oxo-L-proline. These enzymes are part of the ox 26877 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 26878 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 26879 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 26887 Sulfotransferase protein 26893 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 26903 C2 domain 26910 Helix-loop-helix DNA-binding domain 26914 Core histone H2A/H2B/H3/H4 26914 Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi 26926 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 26931 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 26932 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 26934 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 26938 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 26942 TPR Domain 26943 TMS membrane protein/tumour differentially expressed protein (TDE) 26944 Papain family cysteine protease 26946 Transient receptor 26949 Zinc-binding dehydrogenase 26955 DIL domain. The DIL domain has no known function 26955 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 26955 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 26956 Adenylate kinase 26960 Beige/BEACH domain 26962 Ribosomal Proteins L2, C-terminal domain 26962 Ribosomal Proteins L2, RNA binding domain 26965 Cullin family 26968 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 26970 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 26972 SPO11 homologue 26984 Synaptobrevin 26985 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 26986 Poly-adenylate binding protein, unique domain 26987 Eukaryotic initiation factor 4E 26992 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 26998 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 27000 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 27000 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 27010 Thiamin pyrophosphokinase, catalytic domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyzes the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TP 27012 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 27012 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 27022 Fork head domain 27023 Fork head domain 27027 Tetraspanin family 27029 Sulfatase 27033 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 27035 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 27035 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 27035 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 27044 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 27045 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 27049 Ets-domain 27052 Lipase/Acylhydrolase with GDSL-like motif 27053 Glutamine amidotransferases class-II 27053 Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine 27054 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2 27055 FKBP-type peptidyl-prolyl cis-trans isomerase 27060 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 27067 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 27068 Inorganic pyrophosphatase 27068 Inorganic pyrophosphatase 27068 Inorganic pyrophosphatase 27068 Inorganic pyrophosphatase 27071 SH2 domain 27071 PH domain. PH stands for pleckstrin homology 27074 Lysosome-associated membrane glycoprotein (Lamp) 27075 Tetraspanin family 27087 Glycosyltransferase family 43 27087 Glycosyltransferase family 43 27089 pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex 27090 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 27090 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 27090 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 27091 PMP-22/EMP/MP20/Claudin family 27091 PMP-22/EMP/MP20/Claudin family 27092 PMP-22/EMP/MP20/Claudin family 27094 Calcium-activated potassium channel, beta subunit 27094 Calcium-activated potassium channel, beta subunit 27094 Calcium-activated potassium channel, beta subunit 27094 Calcium-activated potassium channel, beta subunit 27095 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 27096 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 27102 Protein kinase domain 27103 Protein kinase domain 27103 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 27104 Protein kinase domain 27104 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 27106 Arrestin (or S-antigen) 27107 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 27115 3'5'-cyclic nucleotide phosphodiesterase 27125 Monocarboxylate transporter 27126 HMG (high mobility group) box 27128 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 27129 Hsp20/alpha crystallin family 27131 PX domain. PX domains bind to phosphoinositides 27131 PX domain. PX domains bind to phosphoinositides 27132 C2 domain 27132 C2 domain 27133 Cyclic nucleotide-binding domain 27133 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 27133 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 27133 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 27133 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 27133 Cyclic nucleotide-binding domain 27133 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 27133 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 27136 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 27139 Ribosomal protein S26e 27140 Homeobox domain 27141 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 27145 Intermediate filament protein 27145 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 27147 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 27147 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 27147 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 27150 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 27150 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 27151 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 27151 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 27154 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 27154 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 27154 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 27161 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 27161 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 27163 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 27163 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 27173 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 27176 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 27177 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 27177 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 27197 7 transmembrane receptor (rhodopsin family) 27198 7 transmembrane receptor (rhodopsin family) 27199 7 transmembrane receptor (rhodopsin family) 27201 7 transmembrane receptor (rhodopsin family) 27202 7 transmembrane receptor (rhodopsin family) 27204 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 27206 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 27207 Ribosomal protein S17 27216 7 transmembrane receptor (rhodopsin family) 27217 Homeobox domain 27223 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 27226 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 27229 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 27233 Sulfotransferase protein 27239 7 transmembrane receptor (rhodopsin family) 27243 Eukaryotic protein of unknown function, DUF279 27246 Zinc finger, C3HC4 type (RING finger) 27252 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 27255 Fibronectin type III domain 27256 Fibronectin type III domain 27257 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 27260 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 27261 PTB domain (IRS-1 type) 27263 Protein kinase domain 27263 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 27273 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 27283 Papain family cysteine protease 27284 Sulfotransferase protein 27285 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 27286 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc 27287 Homeobox domain 27295 LIM domain. This family represents two copies of the LIM structural domain 27299 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 27299 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 27302 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 27306 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 27319 Helix-loop-helix DNA-binding domain 27324 HMG (high mobility group) box 27328 Cadherin domain 27328 Cadherin domain 27328 Cadherin domain 27328 Cadherin domain 27334 7 transmembrane receptor (rhodopsin family) 27336 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 27342 Vacuolar sorting protein 9 (VPS9) domain 27343 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 27345 Calcium-activated potassium channel, beta subunit 27350 Cytidine and deoxycytidylate deaminase zinc-binding region 27352 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 27354 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 27354 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 27359 C2 domain 27359 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 27360 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 27366 Mitosis protein DIM1 27367 Ribosomal protein L3 27369 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 27370 Ribosomal protein S26e 27372 Somatotropin hormone family 27373 Protein kinase domain 27376 Mitochondrial carrier protein 27383 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 27384 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 27385 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 27388 Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos 27395 Ribosomal protein L15 amino terminal region. This family is always associated with pfam00256. This family is diagnostic of ribosomal L15 proteins 27398 Ribosomal Proteins L2, C-terminal domain 27399 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 27400 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 27400 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 27402 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 27402 Biotin-requiring enzyme. This alignment covers two families, the conserved lysine residue binds biotin in one group and lipoic acid in the other 27402 e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit 27402 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 27402 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 27406 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 27410 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 27411 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 27418 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 27418 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 27421 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 27421 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 27433 ATPase family associated with various cellular activities (AAA) 27436 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes 27445 C2 domain 27445 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 27643 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 27681 EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of t 27756 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 27756 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 27756 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 27967 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 27979 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 27993 IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing 28078 Somatotropin hormone family 28080 ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401 28088 Uncharacterized protein family UPF0027 28105 B-box zinc finger 28185 TPR Domain 28193 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 28200 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 28231 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28231 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28232 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28232 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28234 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28234 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28240 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 28240 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 28240 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 28240 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 28240 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 28240 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 28248 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28248 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28250 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28250 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28253 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28253 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28253 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28253 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28254 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28254 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 28298 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 28316 Cadherin domain 28316 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 28511 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 28512 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 28513 Cadherin domain 28513 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 28514 Delta serrate ligand 28956 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 28966 PX domain. PX domains bind to phosphoinositides 28968 Sodium:neurotransmitter symporter family 28978 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 28983 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 28986 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 29065 Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases 29068 BTB/POZ domain 29072 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 29072 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 29074 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 29095 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 29097 Cornichon protein 29103 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 29108 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 29108 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 29108 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 29116 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 29117 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 29119 Vinculin family 29124 Galactoside-binding lectin 29127 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 29128 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 29128 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 29128 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 29130 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 29135 Trypsin 29139 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 29141 Galanin 29144 Calreticulin family 29146 Delta serrate ligand 29147 Delta serrate ligand 29148 Helix-loop-helix DNA-binding domain 29148 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 29152 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 29153 Calpain family cysteine protease 29153 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 29154 Calpain family cysteine protease 29154 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 29155 Calpain family cysteine protease 29155 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 29157 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 29160 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m 29161 Caveolin 29162 Cadherin domain 29162 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 29163 Cadherin domain 29163 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 29165 Trypsin 29167 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 29168 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 29169 Somatomedin B domain 29169 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 29170 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 29172 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 29173 Casein 29179 Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown 29181 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 29182 Cadherin domain 29184 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 29185 Tetraspanin family 29186 Tetraspanin family 29188 Kappa casein. Kappa-casein is a mammalian milk protein involved in a number of important physiological processes. In the gut, the ingested protein is split into an insoluble peptide (para kappa-casein) and a soluble hydrophilic glycopeptide (caseinomacro 29190 Vertebrate endogenous opioids neuropeptide 29191 Neurokinin B 29192 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 29194 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 29197 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 29199 Class I Histocompatibility antigen, domains alpha 1 and 2 29200 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 29202 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 29203 Fork head domain 29205 Synaptogyrin. This family of proteins is distantly related to pfam01284 29209 Calsequestrin 29210 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 29211 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 29215 Arginase family 29216 Low-density lipoprotein receptor domain class A 29216 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 29219 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 29221 Arginase family 29223 Adenylate kinase 29224 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 29225 Carboxylesterase 29227 CTF/NF-I family 29228 CTF/NF-I family 29231 WD domain, G-beta repeat 29232 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 29233 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 29233 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de 29234 7 transmembrane receptor (rhodopsin family) 29235 7 transmembrane receptor (rhodopsin family) 29236 Ribosomal protein S2 29237 Vertebrate endogenous opioids neuropeptide 29238 7 transmembrane receptor (rhodopsin family) 29241 Adenylate and Guanylate cyclase catalytic domain 29242 Eukaryotic-type carbonic anhydrase 29243 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 29246 Stathmin family 29248 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 29251 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 29251 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 29252 Trypsin 29253 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 29256 7 transmembrane receptor (rhodopsin family) 29257 Ribosomal protein L6 29258 Ribosomal protein S7e 29259 Lectin C-type domain. This family includes both long and short form C-type 29260 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 29261 Thymidylate synthase 29264 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 29264 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 29270 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 29272 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 29276 LIM domain. This family represents two copies of the LIM structural domain 29277 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 29278 LIM domain. This family represents two copies of the LIM structural domain 29279 Homeobox domain 29282 Ribosomal protein L14p/L23e 29283 Ribosomal L29e protein family 29286 Ribosomal S17 29287 Ribosomal protein S19e 29288 Ribosomal S3Ae family 29290 7 transmembrane receptor (rhodopsin family) 29292 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 29295 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 29296 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 29298 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 29301 Phenylalanine and histidine ammonia-lyase 29302 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 29304 Ribosomal protein S6e 29307 7 transmembrane receptor (rhodopsin family) 29308 ADP-ribosylation factor family 29310 7 transmembrane receptor (rhodopsin family) 29313 Lectin C-type domain. This family includes both long and short form C-type 29316 7 transmembrane receptor (rhodopsin family) 29317 LIM domain. This family represents two copies of the LIM structural domain 29318 Macrophage migration inhibitory factor (MIF) 29319 Eukaryotic cobalamin-binding protein 29321 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 29322 C2 domain 29322 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 29322 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 29323 Sodium:neurotransmitter symporter family 29324 F-actin capping protein alpha subunit 29331 Extracellular link domain 29332 Stathmin family 29334 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 29335 7 transmembrane receptor (rhodopsin family) 29337 SH2 domain 29337 PH domain. PH stands for pleckstrin homology 29337 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 29340 Protein kinase domain 29340 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 29340 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 29345 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 29347 Amidase 29348 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 29349 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 29352 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 29355 Hr1 repeat 29356 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 29357 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 29357 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 29358 7 transmembrane receptor (rhodopsin family) 29358 7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand 29359 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 29361 HMG (high mobility group) box 29363 7 transmembrane receptor (rhodopsin family) 29366 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 29367 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 29368 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 29369 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 29371 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 29373 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 29374 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 29374 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 29375 Homeobox domain 29381 Protein kinase domain 29381 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t 29383 Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the ty 29385 7 transmembrane receptor (rhodopsin family) 29388 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 29389 Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom 29392 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 29394 GCM motif protein 29395 HMG (high mobility group) box 29403 Hepatic lectin, N-terminal domain 29403 Lectin C-type domain. This family includes both long and short form C-type 29410 Helix-loop-helix DNA-binding domain 29411 Palmitoyl protein thioesterase 29412 7 transmembrane receptor (rhodopsin family) 29413 7 transmembrane receptor (rhodopsin family) 29415 7 transmembrane receptor (rhodopsin family) 29421 Uroporphyrinogen decarboxylase (URO-D) 29422 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 29422 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 29423 IQ calmodulin-binding motif. Calmodulin-binding motif 29425 Proteasome A-type and B-type 29426 Ribosomal family S4e 29431 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 29432 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 29433 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 29434 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 29437 Actin 29438 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 29440 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 29443 S-adenosyl-L-homocysteine hydrolase 29445 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 29446 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 29449 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 29454 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 29458 Helix-loop-helix DNA-binding domain 29460 Protein kinase domain 29462 7 transmembrane receptor (rhodopsin family) 29464 Sodium:neurotransmitter symporter family 29465 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 29466 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 29468 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 29469 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 29471 7 transmembrane receptor (rhodopsin family) 29475 Iodothyronine deiodinase 29476 Cysteine rich repeat. This cysteine rich repeat contains four cysteines. It is found in multiple copies in a protein that binds to fibroblast growth factors. The repeat is also found in MG160 and E-selectin ligand (ESL-1) 29477 Tau and MAP protein, tubulin-binding repeat 29480 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 29482 Sodium:dicarboxylate symporter family 29483 Sodium:dicarboxylate symporter family 29489 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 29492 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 29492 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 29496 Furin-like cysteine rich region 29496 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 29497 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 29498 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. 29499 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 29499 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 29500 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 29501 K-Cl Co-transporter type 1 (KCC1) 29502 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 29503 Sugar (and other) transporter 29504 Sugar (and other) transporter 29507 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 29509 Sugar (and other) transporter 29511 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 29512 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 29515 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 29516 3'5'-cyclic nucleotide phosphodiesterase 29520 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 29521 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy 29524 LIM domain. This family represents two copies of the LIM structural domain 29525 Phosphatidylinositol transfer protein 29526 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 29527 Animal haem peroxidase 29527 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 29528 Synaptobrevin 29530 Protein kinase domain 29530 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 29533 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 29534 Zinc finger, C3HC4 type (RING finger) 29535 Homeobox domain 29539 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 29540 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 29541 HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a 29542 Phosphatidylethanolamine-binding protein 29543 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 29544 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 29545 Ubiquitin carboxyl-terminal hydrolase, family 1 29546 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 29547 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 29548 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 29552 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 29553 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29555 7 transmembrane receptor (Secretin family) 29556 Myosin head (motor domain) 29557 Myosin head (motor domain) 29558 Class I Histocompatibility antigen, domains alpha 1 and 2 29559 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 29563 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 29566 Gamma-glutamyltranspeptidase 29567 Helix-loop-helix DNA-binding domain 29569 CUB domain 29571 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29577 Helix-loop-helix DNA-binding domain 29580 Squalene/phytoene synthase 29581 7 transmembrane receptor (rhodopsin family) 29584 Connexin 29585 Connexin 29586 Connexin 29591 Protein kinase domain 29591 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 29592 Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci 29593 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 29593 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 29595 7 transmembrane receptor (rhodopsin family) 29597 7 transmembrane receptor (rhodopsin family) 29600 E1-E2 ATPase 29600 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 29600 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 29600 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 29601 Connexin 29601 Gap junction alpha-8 protein (Cx50) 29603 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 29603 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 29605 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 29606 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 29609 OAR domain 29609 Homeobox domain 29610 Zona pellucida-like domain 29611 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 29614 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 29614 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 29615 Fibronectin type III domain 29616 Protein-tyrosine phosphatase 29617 Fibronectin type III domain 29618 BTG1 family. A novel family of anti-proliferative proteins 29619 BTG1 family. A novel family of anti-proliferative proteins 29622 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 29623 UDP-glucoronosyl and UDP-glucosyl transferase 29623 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 29624 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 29627 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 29628 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 29629 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 29630 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b 29630 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin 29632 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 29633 Sulfotransferase protein 29635 Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids o 29637 Hydroxymethylglutaryl-coenzyme A synthase 29639 ATP1G1/PLM/MAT8 family 29639 ATP1G1/PLM/MAT8 family 29641 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 29641 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 29642 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 29642 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 29643 Sugar (and other) transporter 29647 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 29647 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 29648 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 29651 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 29657 Helix-loop-helix DNA-binding domain 29659 ATP P2X receptor 29662 GDP dissociation inhibitor 29663 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 29663 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 29665 ATP P2X receptor 29667 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 29674 Proteasome A-type and B-type 29675 Proteasome A-type and B-type 29676 Proteasome A-type and B-type 29678 SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors 29680 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 29681 Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inte 29682 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 29682 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29683 Lectin C-type domain. This family includes both long and short form C-type 29685 MCM2/3/5 family 29685 Sigma-54 interaction domain 29686 Immunoglobulin domain 29688 Histidine acid phosphatase 29689 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 29689 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 29691 3'5'-cyclic nucleotide phosphodiesterase 29694 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 29694 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 29695 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 29695 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 29699 Cyclic nucleotide-binding domain 29699 Cyclic nucleotide-binding domain 29700 Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha) 29701 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29707 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 29707 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 29708 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 29708 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 29710 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29711 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 29711 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 29712 Inward rectifier potassium channel 29713 Inward rectifier potassium channel 29715 Calx-beta domain 29715 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 29716 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29717 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29718 Inward rectifier potassium channel 29719 Inward rectifier potassium channel 29721 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 29722 Phosphotyrosine interaction domain (PTB/PID) 29722 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 29723 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 29726 Sugar (and other) transporter 29731 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 29731 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29733 7 transmembrane receptor (rhodopsin family) 29734 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 29740 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 29743 Mitochondrial carrier protein 29747 Calcium-activated potassium channel, beta subunit 29750 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 29751 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 29752 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 29753 14-3-3 protein 29754 ATP synthase subunit C 29757 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 29758 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 29760 Occludin/ELL family 29761 Ubiquitin carboxyl-terminal hydrolase family 2 29765 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 29766 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 29767 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 29780 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 29785 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 29799 Yippee putative zinc-binding protein 29800 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 29801 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 29807 Thiamin pyrophosphokinase, catalytic domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyzes the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TP 29809 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 29810 BAG domain. Domain present in Hsp70 regulators 29811 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 29812 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 29818 Hsp20/alpha crystallin family 29819 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 29842 Sterile alpha motif (SAM)/Pointed domain 29845 7 transmembrane receptor (rhodopsin family) 29845 7 transmembrane receptor (rhodopsin family) 29845 7 transmembrane receptor (rhodopsin family) 29846 7 transmembrane receptor (rhodopsin family) 29846 7 transmembrane receptor (rhodopsin family) 29847 7 transmembrane receptor (rhodopsin family) 29847 7 transmembrane receptor (rhodopsin family) 29849 7 transmembrane receptor (rhodopsin family) 29849 7 transmembrane receptor (rhodopsin family) 29849 7 transmembrane receptor (rhodopsin family) 29850 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 29856 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 29858 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 29858 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 29859 Sulfotransferase protein 29861 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 29861 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 29863 3'5'-cyclic nucleotide phosphodiesterase 29871 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 29877 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 29880 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 29881 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 29882 Cullin family 29884 Apoptosis regulator proteins, Bcl-2 family 29886 PX domain. PX domains bind to phosphoinositides 29887 PX domain. PX domains bind to phosphoinositides 29889 GTPase of unknown function 29890 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 29894 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 29901 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 29904 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 29906 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 29907 MIT domain 29907 PX domain. PX domains bind to phosphoinositides 29907 PX domain. PX domains bind to phosphoinositides 29909 7 transmembrane receptor (rhodopsin family) 29911 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB 29916 PX domain. PX domains bind to phosphoinositides 29916 PX domain. PX domains bind to phosphoinositides 29920 Delta 1-pyrroline-5-carboxylate reductase 29922 Nucleoside diphosphate kinases 29928 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 29929 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum 29933 7 transmembrane receptor (rhodopsin family) 29934 PX domain. PX domains bind to phosphoinositides 29937 Steroid binding domain 29941 Hr1 repeat 29943 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 29952 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 29953 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 29956 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 29956 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 29956 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 29957 Mitochondrial carrier protein 29957 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 29958 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 29964 LIM domain. This family represents two copies of the LIM structural domain 29967 CUB domain 29978 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 29979 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 29979 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 29985 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 29986 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 29988 Sugar (and other) transporter 29989 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 29991 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 29994 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 29995 LIM domain. This family represents two copies of the LIM structural domain 30001 Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER 30012 Homeobox domain 30044 7 transmembrane receptor (rhodopsin family) 30045 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 30053 Lectin C-type domain. This family includes both long and short form C-type 30055 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 30056 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 30057 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 30058 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 30059 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 30060 Transferrin 30062 OAR domain 30062 Homeobox domain 30794 LIM domain. This family represents two copies of the LIM structural domain 30794 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 30795 FKBP-type peptidyl-prolyl cis-trans isomerase 30800 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 30800 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 30805 Sugar (and other) transporter 30806 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 30806 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 30812 HMG (high mobility group) box 30813 Homeobox domain 30814 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with 30815 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 30816 ENV polyprotein (coat polyprotein) 30817 7 transmembrane receptor (Secretin family) 30817 7 transmembrane receptor (Secretin family) 30817 7 transmembrane receptor (Secretin family) 30817 7 transmembrane receptor (Secretin family) 30817 7 transmembrane receptor (Secretin family) 30817 7 transmembrane receptor (Secretin family) 30817 7 transmembrane receptor (Secretin family) 30827 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 30832 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 30837 SH2 domain 30841 F-box domain 30841 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 30843 F-box domain 30877 GTPase of unknown function 30923 Fork head domain 30924 Uncharacterized ACR, COG1579 30924 Fibrinogen beta and gamma chains, C-terminal globular domain 30928 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 30930 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 30937 LIM domain. This family represents two copies of the LIM structural domain 30940 Ubiquitin carboxyl-terminal hydrolase family 2 30942 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 30942 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 30945 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 30945 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 30948 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 30951 'chromo' (CHRromatin Organization MOdifier) domain 30952 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 30955 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 30955 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 30956 Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases 30956 Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopin 30957 Intermediate filament protein 30960 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 30968 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 49854 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 49860 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 50484 Ribonucleotide reductase, small chain 50484 Ribonucleotide reductase, small chain 50485 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 50490 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 50496 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 50497 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 50500 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 50500 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 50506 Animal haem peroxidase 50506 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 50507 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 50508 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 50515 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 50522 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 50527 Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER 50528 Trypsin 50529 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 50540 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 50545 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 50549 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 50554 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 50554 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 50555 UDP-glucoronosyl and UDP-glucosyl transferase 50556 Exocyst complex subunit Sec15-like 50560 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 50563 Connexin 50564 Connexin 50566 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 50570 Ribosomal protein L5 50570 PH domain. PH stands for pleckstrin homology 50570 ribosomal L5P family C-terminus. This region is found associated with pfam00281 50572 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 50572 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 50577 7 transmembrane receptor (rhodopsin family) 50592 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 50594 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 50599 Inward rectifier potassium channel 50604 Interleukin 10 50613 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 50614 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 50616 Interleukin 10 50617 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 50617 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 50617 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 50621 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 50622 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 50640 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 50645 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 50646 Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta 50654 Papain family cysteine protease 50659 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 50659 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 50662 Runt domain 50664 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 50665 Thymosin beta-4 family 50669 C2 domain 50669 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 50671 Zinc-binding dehydrogenase 50671 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 50671 Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid th 50672 7 transmembrane receptor (rhodopsin family) 50672 7 transmembrane receptor (rhodopsin family) 50674 Helix-loop-helix DNA-binding domain 50676 Sodium:neurotransmitter symporter family 50677 Fibronectin type III domain 50678 3'5'-cyclic nucleotide phosphodiesterase 50681 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 50682 ab-hydrolase associated lipase region 50687 Fibronectin type III domain 50688 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 50690 Sodium:neurotransmitter symporter family 50694 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 50702 Sushi domain (SCR repeat) 50706 Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria 50708 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 50709 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 50721 Sir2 family 50755 F-box domain 50755 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near 50759 F-box domain 50760 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 50762 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 50765 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 50765 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 50766 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 50767 Cyclic nucleotide-binding domain 50776 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 50779 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 50779 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 50779 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 50779 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 50779 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 50780 C2 domain 50782 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 50782 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 50782 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 50784 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 50786 FlhB HrpN YscU SpaS Family 50789 F-box domain 50789 F-box domain 50791 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 50791 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 50791 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 50797 Coatomer WD associated domain 50797 WD domain, G-beta repeat 50797 Coatomer WD associated domain 50798 ROK family 50798 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 50798 UDP-N-acetylglucosamine 2-epimerase. This family consists of UDP-N-acetylglucosamine 2-epimerases EC:5.1.3.14 this enzyme catalyses the production of UDP-ManNAc from UDP-GlcNAc. Note that some of the enzymes is this family are bifunctional, in this instan 50799 Mitochondrial carrier protein 50801 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 50808 Adenylate kinase 50810 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 50814 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 50846 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 50846 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 50850 MIT domain 50850 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 50859 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 50861 Stathmin family 50868 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 50873 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 50873 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 50874 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 50875 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 50876 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 50878 Stromal antigen (SA/STAG) protein 50908 CUB domain 50908 Sushi domain (SCR repeat) 50909 CUB domain 50912 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 50912 HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease 50912 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin 50913 Helix-loop-helix DNA-binding domain 50914 Helix-loop-helix DNA-binding domain 50917 Sulfatase 50929 Interleukin 10 50932 Protein kinase domain 50932 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 50933 Ubiquitin carboxyl-terminal hydrolase, family 1 50933 Ubiquitin carboxyl-terminal hydrolase, family 1 50935 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 50937 Fibronectin type III domain 50938 Fibronectin type III domain 50939 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 50940 3'5'-cyclic nucleotide phosphodiesterase 50944 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 50964 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 50993 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 50995 Repeat in ubiquitin-activating (UBA) protein 50995 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 50997 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 50999 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 51001 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 51014 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 51015 Isochorismatase family. This family are hydrolase enzymes 51016 Uncharacterised protein family (UPF0172) 51025 Uncharacterised protein family (UPF0108) 51026 Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events 51028 Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking 51031 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 51032 Trypsin 51043 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 51046 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 51053 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso 51058 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 51061 Intermediate filament protein 51061 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 51062 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 51062 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 51064 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HC 51065 Ribosomal protein S27 51072 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 51074 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 51075 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 51082 RNA polymerases L / 13 to 16 kDa subunit 51084 3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin 51085 Helix-loop-helix DNA-binding domain 51085 Helix-loop-helix DNA-binding domain 51087 'Cold-shock' DNA-binding domain 51094 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 51109 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51118 Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This 51119 Uncharacterized protein family UPF0023 51127 B-box zinc finger 51127 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 51130 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51130 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51133 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 51135 Protein kinase domain 51148 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va 51156 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 51157 Zinc finger 51160 VPS28 protein 51163 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 51167 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 51170 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51171 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51176 HMG (high mobility group) box 51179 FMN-dependent dehydrogenase 51181 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51182 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 51184 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 51196 C2 domain 51200 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 51205 Histidine acid phosphatase 51208 PMP-22/EMP/MP20/Claudin family 51209 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 51226 Clathrin adaptor complex small chain 51230 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 51231 Protein kinase domain 51232 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 51232 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 51236 Domain of unknown function (DUF383). domo keywords :chromosome c26f1.12c 41.3 51236 Domain of unknown function (DUF384). domo keywords :chromosome c26f1.12c 41.3 51240 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 51270 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 51276 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 51277 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 51281 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51284 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 51289 7 transmembrane receptor (rhodopsin family) 51292 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 51296 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 51300 SURF4 family 51301 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 51302 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 51305 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 51308 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 51310 Sugar (and other) transporter 51310 Sugar (and other) transporter 51311 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 51311 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 51312 Mitochondrial carrier protein 51312 Mitochondrial carrier protein 51314 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 51317 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 51332 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 51338 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 51338 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 51340 HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins 51341 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 51350 Intermediate filament protein 51360 Peptidase family M50 51363 Sulfotransferase protein 51363 Sulfotransferase protein 51364 MYND finger 51365 Lipase 51367 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 51375 PX domain. PX domains bind to phosphoinositides 51375 PX domain. PX domains bind to phosphoinositides 51377 Ubiquitin carboxyl-terminal hydrolase, family 1 51380 Pyridoxal-dependent decarboxylase conserved domain 51382 ATP synthase subunit D. This is a family of subunit D form various ATP synthases including V-type H+ transporting and Na+ dependent. Subunit D is suggested to be an integral part of the catalytic sector of the V-ATPase 51384 wnt family 51384 wnt family 51385 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 51398 Uncharacterised protein family (UPF0139) 51399 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 51402 HSF-type DNA-binding 51411 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 51411 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 51429 PX domain. PX domains bind to phosphoinositides 51438 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 51441 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 51447 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 51449 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 51450 Homeobox domain 51451 Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence 51451 Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence 51454 Phosphotyrosine interaction domain (PTB/PID) 51454 Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome 51455 impB/mucB/samB family. These proteins are involved in UV protection 51455 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 51458 Ammonium Transporter Family 51465 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 51465 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 51466 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 51471 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 51474 LIM domain. This family represents two copies of the LIM structural domain 51477 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play 51478 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51479 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 51479 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 51493 Uncharacterized protein family UPF0027 51496 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 51504 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 51522 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 51523 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 51554 7 transmembrane receptor (rhodopsin family) 51554 7 transmembrane receptor (rhodopsin family) 51559 haloacid dehalogenase-like hydrolase 51560 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 51567 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 51569 Uncharacterised protein family (UPF0185). This family contains a number of small uncharacterised proteins including BM-002 51573 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 51580 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 51582 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 51582 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 51588 pfam02891, zf-MIZ, MIZ zinc finger 51592 B-box zinc finger 51592 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 51592 B-box zinc finger 51592 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 51596 CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide var 51604 Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to ne 51605 Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3 51606 Adaptin N terminal region 51608 Protein of unknown function (DUF410). Family of conserved eukaryotic proteins with undetermined function 51616 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra 51629 Mitochondrial carrier protein 51631 Protein of unknown function, DUF259 51634 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 51646 Yippee putative zinc-binding protein 51652 Eukaryotic protein of unknown function, DUF279 51657 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 51659 Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.` 51660 Uncharacterised protein family (UPF0041) 51661 FKBP-type peptidyl-prolyl cis-trans isomerase 51661 FKBP-type peptidyl-prolyl cis-trans isomerase 51665 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51666 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51666 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51676 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 51678 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 51678 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 51686 Ornithine decarboxylase antizyme 51691 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 51700 Oxidoreductase FAD-binding domain 51702 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 51704 7 transmembrane receptor (metabotropic glutamate family) 51706 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 51710 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 51715 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 51716 Carboxylesterase 51719 Mo25 protein family 51727 Adenylate kinase 51733 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 51734 SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four deca 51741 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51741 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51741 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51741 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 51742 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 51742 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 51747 Protein of unknown function, DUF259 51752 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 51760 C2 domain 51762 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 51773 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 51776 Protein kinase domain 51780 jmjC domain 51789 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 51791 Raf-like Ras-binding domain 51792 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 51793 Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme 51795 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc 51796 PWI domain 51799 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 51800 Apoptosis regulator proteins, Bcl-2 family 51802 Amiloride-sensitive sodium channel 51802 Amiloride-sensitive sodium channel 51807 Tubulin/FtsZ family 51809 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 51810 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 51811 Lectin C-type domain. This family includes both long and short form C-type 51812 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 51816 Adenosine/AMP deaminase 51816 Adenosine/AMP deaminase 51885 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 51960 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 51960 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 52024 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 52033 Protein kinase domain 52040 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 52064 ubiE/COQ5 methyltransferase family 52250 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 52323 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 52323 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 52331 Starch binding domain 52331 Starch binding domain 52389 7 transmembrane receptor (Secretin family) 52428 Hr1 repeat 52428 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 52443 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 52468 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 52530 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 52538 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 52552 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 52563 Anaphase promoting complex subunit 8 / cdc23. The anaphase-promoting complex is composed of eight protein subunits, including BimE (APC1), CDC27 (APC3), CDC16 (APC6), and CDC23 (APC8) 52575 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 52585 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 52588 Tetraspanin family 52609 'chromo' (CHRromatin Organization MOdifier) domain 52614 7 transmembrane receptor (Secretin family) 52614 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 52633 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 52635 C2 domain 52653 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 52679 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 52679 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 52685 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 52700 Thioredoxin 52822 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 52837 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 52850 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 52857 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 52858 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 52858 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 52906 WD domain, G-beta repeat 52906 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 52915 MIZ zinc finger 52915 pfam02891, zf-MIZ, MIZ zinc finger 53315 Sulfotransferase protein 53318 LIM domain. This family represents two copies of the LIM structural domain 53319 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 53319 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 53320 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 53320 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 53321 Fibrinogen beta and gamma chains, C-terminal globular domain 53321 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 53328 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 53330 Synaptobrevin 53335 Zinc finger, C2H2 type 53339 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 53345 Extracellular link domain 53349 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 53349 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 53349 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 53349 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 53349 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 53353 Low-density lipoprotein receptor domain class A 53354 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 53354 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 53354 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 53357 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 53358 Phosphotyrosine interaction domain (PTB/PID) 53373 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 53382 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 53412 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 53412 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 53413 Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocys 53418 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 53419 Low-density lipoprotein receptor domain class A 53419 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 53420 C2 domain 53424 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 53605 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 53606 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 53606 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 53607 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 53610 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 53615 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha 53616 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 53616 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 53616 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 53616 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 53616 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 53616 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 53616 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 53616 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 53616 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 53616 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 53617 Intermediate filament protein 53620 Synaptobrevin 53622 Intermediate filament protein 53623 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 53624 PMP-22/EMP/MP20/Claudin family 53625 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 53627 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 53627 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 53627 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 53627 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 53630 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 53637 7 transmembrane receptor (rhodopsin family) 53791 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 53814 Ornithine decarboxylase antizyme 53817 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 53817 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 53817 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 53817 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 53822 ATP1G1/PLM/MAT8 family 53826 ATP1G1/PLM/MAT8 family 53827 ATP1G1/PLM/MAT8 family 53829 7 transmembrane receptor (rhodopsin family) 53829 7 transmembrane receptor (rhodopsin family) 53831 7 transmembrane receptor (rhodopsin family) 53832 Tissue factor 53836 7 transmembrane receptor (rhodopsin family) 53840 B-box zinc finger 53840 Zinc finger, C3HC4 type (RING finger) 53840 B-box zinc finger 53840 Zinc finger, C3HC4 type (RING finger) 53860 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 53861 Zn-finger in Ran binding protein and others 53867 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 53870 Fibronectin type III domain 53871 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 53881 Sodium:solute symporter family 53883 7 transmembrane receptor (Secretin family) 53883 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 53883 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 53885 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 53893 NUDIX domain 53901 RNA recognition motif. (a.k.a. RRM 53905 Animal haem peroxidase 53905 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 53905 Animal haem peroxidase 53905 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 53917 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 53918 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known 53919 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 53919 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 53938 Cyclophilin type peptidyl-prolyl cis-trans isomerase 53940 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 53942 Fibronectin type III domain 53942 Fibronectin type III domain 53944 Protein kinase domain 53949 7 transmembrane receptor (rhodopsin family) 53950 Somatotropin hormone family 53970 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 53973 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 53978 7 transmembrane receptor (rhodopsin family) 53978 7 transmembrane receptor (rhodopsin family) 53981 Metallo-beta-lactamase superfamily 54003 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 54006 MYND finger 54006 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 54059 Uncharacterized protein family UPF0054 54084 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 54101 Protein kinase domain 54101 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 54106 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 54106 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 54112 7 transmembrane receptor (rhodopsin family) 54124 Cyclin-dependent kinase regulatory subunit 54125 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 54127 Ribosomal protein S28e 54128 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 54132 LIM domain. This family represents two copies of the LIM structural domain 54132 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 54133 LIM domain. This family represents two copies of the LIM structural domain 54133 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 54140 7 transmembrane receptor (rhodopsin family) 54145 Core histone H2A/H2B/H3/H4 54150 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 54151 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 54152 Dynein light chain type 1 54159 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 54165 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 54169 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 54189 Rabaptin 54190 Rabaptin 54192 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 54193 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 54195 Adenylate and Guanylate cyclase catalytic domain 54199 7 transmembrane receptor (rhodopsin family) 54200 Sulfotransferase protein 54204 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 54207 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54207 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54207 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54214 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 54216 Cadherin domain 54217 Ribosomal protein L36e 54218 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 54223 Adenylate and Guanylate cyclase catalytic domain 54226 Beta-amyloid peptide (beta-APP) 54226 Beta-amyloid peptide (beta-APP) 54230 BTG1 family. A novel family of anti-proliferative proteins 54231 Eukaryotic-type carbonic anhydrase 54232 Eukaryotic-type carbonic anhydrase 54233 Eukaryotic-type carbonic anhydrase 54234 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54236 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 54239 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 54239 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 54240 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 54240 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 54241 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 54241 Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le 54243 Sushi domain (SCR repeat) 54245 SH2 domain 54246 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 54248 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 54248 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 54249 Trypsin 54250 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 54256 Connexin 54257 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 54258 7 transmembrane receptor (rhodopsin family) 54260 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 54261 Calcium-activated SK potassium channel 54261 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 54262 Calcium-activated SK potassium channel 54262 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 54263 Calcium-activated SK potassium channel 54263 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 54264 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 54264 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 54267 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 54269 Trypsin 54274 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 54276 Helix-loop-helix DNA-binding domain 54278 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 54278 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 54279 Fibronectin type III domain 54280 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 54281 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 54282 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 54283 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 54283 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 54284 Homeobox domain 54286 Protein kinase domain 54290 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 54292 Raf-like Ras-binding domain 54292 Phosphotyrosine interaction domain (PTB/PID) 54293 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 54295 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 54295 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 54295 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 54296 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 54296 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 54301 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 54302 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 54302 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 54304 Single-strand binding protein family 54305 7 transmembrane receptor (rhodopsin family) 54309 C2 domain 54311 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 54314 Animal haem peroxidase 54315 Mitochondrial carrier protein 54319 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 54319 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 54321 Calponin family repeat 54322 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 54323 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 54324 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 54324 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 54325 GNS1/SUR4 family 54326 GNS1/SUR4 family 54328 7 transmembrane receptor (rhodopsin family) 54329 7 transmembrane receptor (rhodopsin family) 54338 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 54345 HMG (high mobility group) box 54349 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 54350 Sec1 family 54354 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 54355 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 54357 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 54361 wnt family 54363 FMN-dependent dehydrogenase 54364 RNase P subunit p30. This protein is part of the RNase P complex that is involved in tRNA maturation 54366 Vinculin family 54368 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 54371 Sulfotransferase protein 54373 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 54375 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 54376 PMP-22/EMP/MP20/Claudin family 54377 PMP-22/EMP/MP20/Claudin family 54381 Peptidase family M28D 54381 Peptidase family M28D 54383 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 54387 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 54391 Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-pho 54392 Domain of unknown function (DUF315). Family of plant hypothetical proteins 54392 Chromosome condensation protein 3, C-terminal region. Cnd3 is a Member of the five subunit condensin complex. Each subunit is essential for mitotic condensation 54393 7 transmembrane receptor (metabotropic glutamate family) 54393 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 54397 Palmitoyl protein thioesterase 54398 Palmitoyl protein thioesterase 54401 14-3-3 protein 54403 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 54403 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 54407 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 54410 Somatomedin B domain 54410 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 54413 Carboxylesterase 54419 PMP-22/EMP/MP20/Claudin family 54420 PMP-22/EMP/MP20/Claudin family 54422 Homeobox domain 54426 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 54431 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 54434 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 54437 Thrombospondin type 1 domain 54437 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 54437 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 54439 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 54439 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 54441 Stromal antigen (SA/STAG) protein 54442 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 54453 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 54454 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 54454 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 54455 F-box domain 54455 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 54464 Putative 5'-3' exonuclease domain. This family aligns residues towards the N-terminus of several proteins with multiple functions. The members of this family all appear to possess 5'-3' exonuclease activity EC:3.1.11.-. Thus, the aligned region may be nec 54466 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 54467 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 54467 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 54469 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 54471 Mab-21 protein 54474 Intermediate filament protein 54480 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 54483 B-box zinc finger 54483 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 54485 Delta serrate ligand 54486 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 54490 UDP-glucoronosyl and UDP-glucosyl transferase 54498 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 54498 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 54498 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 54498 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 54498 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 54504 Serine carboxypeptidase 54504 Serine carboxypeptidase 54510 Cadherin domain 54511 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 54513 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 54517 Uncharacterized protein family UPF0024 54519 PH domain. PH stands for pleckstrin homology 54519 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 54522 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 54524 C2 domain 54525 C2 domain 54526 C2 domain 54527 C2 domain 54531 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 54531 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 54536 Exocyst complex subunit Sec15-like 54542 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 54549 Fibronectin type III domain 54550 Uncharacterized ACR, YneC family COG1359 54551 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 54552 GTPase of unknown function 54561 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 54563 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 54563 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 54566 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 54566 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 54567 Delta serrate ligand 54575 UDP-glucoronosyl and UDP-glucosyl transferase 54576 UDP-glucoronosyl and UDP-glucosyl transferase 54577 UDP-glucoronosyl and UDP-glucosyl transferase 54578 UDP-glucoronosyl and UDP-glucosyl transferase 54596 L1 transposable element 54598 7 transmembrane receptor (Secretin family) 54600 UDP-glucoronosyl and UDP-glucosyl transferase 54609 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 54610 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 54611 3'5'-cyclic nucleotide phosphodiesterase 54613 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 54616 Uncharacterized ACR, COG1579 54616 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 54617 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 54619 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi 54620 F-box domain 54622 ADP-ribosylation factor family 54623 Paf1. Members of this family are components of the RNA polymerase II associated Paf1 complex. The Paf1 complex functions during the elongation phase of transcription in conjunction with Spt4-Spt5 and Spt16-Pob3i 54624 Paf1. Members of this family are components of the RNA polymerase II associated Paf1 complex. The Paf1 complex functions during the elongation phase of transcription in conjunction with Spt4-Spt5 and Spt16-Pob3i 54624 Paf1. Members of this family are components of the RNA polymerase II associated Paf1 complex. The Paf1 complex functions during the elongation phase of transcription in conjunction with Spt4-Spt5 and Spt16-Pob3i 54625 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 54625 Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi 54626 Helix-loop-helix DNA-binding domain 54630 LIM domain. This family represents two copies of the LIM structural domain 54631 Fibronectin type III domain 54632 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 54633 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 54644 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 54646 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 54651 Ubiquitin carboxyl-terminal hydrolase family 2 54651 Ubiquitin carboxyl-terminal hydrolases family 2 54652 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54652 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54657 UDP-glucoronosyl and UDP-glucosyl transferase 54658 UDP-glucoronosyl and UDP-glucosyl transferase 54659 UDP-glucoronosyl and UDP-glucosyl transferase 54673 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 54674 Fibronectin type III domain 54674 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 54677 Choline/Carnitine o-acyltransferase 54704 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 54705 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 54707 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 54710 Sulfotransferase protein 54712 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 54712 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 54714 Cyclic nucleotide-binding domain 54714 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54716 Sodium:neurotransmitter symporter family 54716 Sodium:neurotransmitter symporter family 54718 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 54725 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 54726 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 54729 Homeobox domain 54732 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 54734 ADP-ribosylation factor family 54734 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 54737 'chromo' (CHRromatin Organization MOdifier) domain 54737 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 54738 Ets-domain 54739 TRAF-type zinc finger 54741 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei 54751 LIM domain. This family represents two copies of the LIM structural domain 54751 LIM domain. This family represents two copies of the LIM structural domain 54752 Fibronectin type III domain 54760 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic 54763 Regulatory subunit of type II PKA R-subunit 54765 B-box zinc finger 54766 BTG1 family. A novel family of anti-proliferative proteins 54795 Transient receptor 54798 Cadherin domain 54799 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino te 54800 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 54800 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 54807 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 54812 Presenilin 54813 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 54814 Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally im 54822 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 54822 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 54825 Cadherin domain 54826 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic d 54831 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 54840 HIT family 54840 HIT family 54840 HIT family 54840 HIT family 54840 HIT family 54843 C2 domain 54843 C2 domain 54850 F-box domain 54852 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 54857 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 54860 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 54865 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 54865 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 54865 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 54869 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 54869 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 54873 Paralemmin 54877 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol 54878 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 54879 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 54879 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 54879 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 54879 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 54886 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 54896 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 54898 GNS1/SUR4 family 54899 PX domain. PX domains bind to phosphoinositides 54904 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 54904 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 54904 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 54905 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 54906 O-methyltransferase 54910 Sema domain 54910 Plexin repeat 54915 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 54916 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 54920 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou 54922 DIL domain. The DIL domain has no known function 54931 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 54932 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 54932 Protein of unknown function DUF82. This prokaryotic protein family has no known function. The protein contains four conserved cysteines that may be involved in metal binding or disulphide bridges 54933 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 54936 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi 54937 Helix-loop-helix DNA-binding domain 54946 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha 54948 Ribosomal protein L16 54949 Domain of unknown function (DUF339). This family of small proteins are uncharacterised 54952 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 54957 Mitosis protein DIM1 54961 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 54961 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 54977 Mitochondrial carrier protein 54993 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 54993 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 54993 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 54996 MOSC domain. The MOSC (MOCO sulfurase C-terminal) domain is a superfamily of beta-strand-rich domains identified in the molybdenum cofactor sulfurase and several other proteins from both prokaryotes and eukaryotes. These MOSC domains contain an absolutel 55032 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 55034 MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold 55036 Ezrin/radixin/moesin family 55061 Sushi domain (SCR repeat) 55063 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 55066 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 55074 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 55089 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 55095 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 55103 PH domain. PH stands for pleckstrin homology 55105 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 55108 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 55109 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 55110 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the 55111 PH domain. PH stands for pleckstrin homology 55117 Sodium:neurotransmitter symporter family 55117 Sodium:neurotransmitter symporter family 55119 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou 55124 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 55124 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 55128 B-box zinc finger 55130 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 55130 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 55148 Putative zinc finger in N-recognin 55154 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacteria 55156 Armadillo/beta-catenin-like repeat 55161 Uncharacterised protein family (UPF0121). Uncharacterised integral membrane protein family 55175 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55175 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 55186 Mitochondrial carrier protein 55187 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 55190 NUDIX domain 55193 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 55193 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 55198 PH domain. PH stands for pleckstrin homology 55203 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 55203 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 55207 ADP-ribosylation factor family 55208 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 55210 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 55217 Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1 55218 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 55219 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 55224 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 55229 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 55230 Ubiquitin carboxyl-terminal hydrolase family 2 55231 Microtubule associated protein (MAP65/ASE1 family) 55233 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 55236 Repeat in ubiquitin-activating (UBA) protein 55236 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 55237 Transposase. Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family 55238 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 55245 Ubiquinol-cytochrome C chaperone 55251 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 55253 Flavodoxin 55256 ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be 55274 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 55274 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 55275 Vps53-like, N-terminal domain. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 55284 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 55289 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 55291 Dehydrogenase E1 component 55293 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also membe 55295 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55295 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 55300 Phosphatidylinositol 3- and 4-kinase 55303 GTPase of unknown function 55312 Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-pho 55315 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 55327 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 55334 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 55341 GTPase of unknown function 55342 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 55344 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 55349 GMC oxidoreductase. This family of proteins bind FAD as a cofactor 55350 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 55350 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1 55353 Golgi 4-transmembrane spanning transporter 55356 Sugar (and other) transporter 55362 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 55364 Uncharacterized protein family UPF0029 55366 7 transmembrane receptor (rhodopsin family) 55367 Death domain 55367 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 55367 Death domain 55367 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 55367 Death domain 55367 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 55370 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 55422 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 55486 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 55500 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 55501 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 55502 Helix-loop-helix DNA-binding domain 55503 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 55503 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 55505 Nucleolar RNA-binding protein, Nop10p family. Nop10p is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA 55506 Core histone H2A/H2B/H3/H4 55506 Appr-1"-p processing enzyme family 55507 7 transmembrane receptor (metabotropic glutamate family) 55512 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 55515 Amiloride-sensitive sodium channel 55515 Amiloride-sensitive sodium channel 55521 B-box zinc finger 55530 Sugar (and other) transporter 55553 HMG (high mobility group) box 55558 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 55558 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 55567 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 55575 MSP (Major sperm protein) domain 55582 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 55584 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 55584 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 55585 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 55593 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 55593 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 55605 Kinesin motor domain 55605 WD domain, G-beta repeat 55605 Intermediate filament protein 55605 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 55605 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 55605 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 55614 Kinesin motor domain 55614 Kinesin motor domain 55614 PX domain. PX domains bind to phosphoinositides 55616 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 55621 N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The 55623 pfam02926, THUMP, THUMP domain. The THUMP domain is named after after thiouridine synthases, methylases and PSUSs. The THUMP domain consists of about 110 amino acid residues. It is predicted that this domain is an RNA-binding domain that adopts an alpha/ 55624 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o 55630 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 55630 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 55636 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 55636 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 55636 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 55643 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55647 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 55650 Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function 55651 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 55659 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 55662 jmjC domain 55663 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 55664 Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90 55667 Sulfotransferase proteins 55679 LIM domain. This family represents two copies of the LIM structural domain 55680 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 55680 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 55681 Protein kinase domain 55684 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 55687 tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs 55691 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 55692 Protein of unknown function, DUF259 55693 jmjC domain 55698 DIL domain. The DIL domain has no known function 55698 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 55709 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55709 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55711 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 55722 Intermediate filament proteins 55727 BTB/POZ domain. The BTB (for BR-C 55728 Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relat 55733 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 55737 Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vp 55738 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 55738 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 55741 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 55742 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 55743 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 55745 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 55746 Nup133 nucleoporin. RNA undergoing nuclear export first encounters the basket of the nuclear pore. Nup133 is a nucleoporin accessible on the basket side of the pore 55749 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 55751 Domain of unknown function 55752 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 55754 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in resp 55757 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 55758 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 55762 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 55768 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 55770 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 55775 Alpha amylase 55783 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 55786 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 55790 Galactosyltransferase 55794 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 55799 Cache domain 55801 Interleukin 10 55803 PH domain. PH stands for pleckstrin homology 55803 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 55806 jmjC domain 55806 jmjC domain 55808 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 55818 jmjC domain 55819 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 55821 Allantoicase repeat. This family is found in pairs in Allantoicases, forming the majority of the protein. These proteins allow the use of purines as secondary nitrogen sources in nitrogen-limiting conditions through the reaction: allantoate + H(2)0 = (-) 55823 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 55825 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 55830 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 55830 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 55832 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 55833 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 55833 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 55833 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 55846 FG-GAP repeat 55849 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain conta 55867 Sugar (and other) transporter 55869 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 55870 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 55870 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 55870 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 55871 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 55872 Protein kinase domain 55879 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 55879 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 55885 LIM domain. This family represents two copies of the LIM structural domain 55888 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 55890 7 transmembrane receptor (metabotropic glutamate family) 55890 7 transmembrane receptor (metabotropic glutamate family) 55892 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55897 Helix-loop-helix DNA-binding domain 55904 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 55904 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 55907 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-oc 55927 Helix-loop-helix DNA-binding domain 55930 DIL domain. The DIL domain has no known function 55930 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 55930 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 55932 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 55932 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 55938 Lipocalin / cytosolic fatty-acid binding protein family 55939 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 55946 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 55948 14-3-3 protein 55951 Uncharacterised protein family (UPF0041) 55954 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 55958 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55959 Sulfatase 55959 Sulfatase 55961 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 55963 Sodium:dicarboxylate symporter family 55964 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 55964 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 55968 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 55968 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 55968 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 55972 Mitochondrial carrier protein 55974 MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these p 55975 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 55975 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 55979 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 55980 Inositol monophosphatase family 55981 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 55982 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 55983 TRAF-type zinc finger 55987 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 55988 PX domain. PX domains bind to phosphoinositides 55990 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 55992 B-box zinc finger 55992 Filamin/ABP280 repeat 55992 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 55992 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid 55992 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph 55994 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 55994 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 56000 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 56001 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 56001 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 56001 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 56001 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 56003 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 56010 14-3-3 protein 56011 14-3-3 protein 56014 7 transmembrane receptor (rhodopsin family) 56015 7 transmembrane receptor (rhodopsin family) 56017 Sugar (and other) transporter 56031 Cyclophilin type peptidyl-prolyl cis-trans isomerase 56032 Uncharacterised protein family (UPF0171) 56033 Homeobox domain 56036 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 56036 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 56043 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 56043 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 56045 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 56046 Ubiquinol-cytochrome C chaperone 56050 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 56052 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 56057 BTG1 family. A novel family of anti-proliferative proteins 56062 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 56062 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 56066 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 56068 Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function 56070 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 56077 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 56077 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 56078 Eukaryotic-type carbonic anhydrase 56078 Eukaryotic-type carbonic anhydrase 56083 Transglutaminase family 56083 Transglutaminase family, C-terminal ig like domain 56083 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 56084 VHS domain. Domain present in VPS-27, Hrs and STAM 56085 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 56086 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 56096 Zona pellucida-like domain 56109 Cadherin domain 56109 Cadherin domain 56110 Cadherin domain 56110 Cadherin domain 56111 Cadherin domain 56111 Cadherin domain 56112 Cadherin domain 56112 Cadherin domain 56113 Cadherin domain 56113 Cadherin domain 56129 Cadherin domain 56137 Cadherin domain 56137 Cadherin domain 56139 Cadherin domain 56142 Cadherin domain 56144 Cadherin domain 56144 Cadherin domain 56147 Cadherin domain 56150 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 56160 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 56163 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 56164 Protein kinase domain 56164 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 56164 Protein kinase domain 56164 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 56165 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 56171 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 56173 PMP-22/EMP/MP20/Claudin family 56175 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 56180 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 56183 Neuromedin U 56184 Helix-loop-helix DNA-binding domain 56185 FMN-dependent dehydrogenase 56188 ATP1G1/PLM/MAT8 family 56189 Proline dehydrogenase 56191 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 56191 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 56193 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 56195 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 56196 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 56198 Helix-loop-helix DNA-binding domain 56203 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 56207 Ubiquitin carboxyl-terminal hydrolase, family 1 56208 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso 56209 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 56210 impB/mucB/samB family. These proteins are involved in UV protection 56210 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 56214 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 56215 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 56216 Syntaxin 56218 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 56218 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 56219 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 56220 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 56224 Tetraspanin family 56228 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 56248 Adenylate kinase 56252 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 56257 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 56258 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 56261 Starch binding domain 56261 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 56264 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 56265 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 56266 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 56275 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 56284 Ribosomal protein L19 56287 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 56288 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 56289 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 56293 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 56294 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 56294 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 56296 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 56297 ADP-ribosylation factor family 56299 TPR Domain 56299 TPR Domain 56299 TPR Domain 56302 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56305 Phosphatidylinositol transfer protein 56307 metallopeptidase family M24 56314 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 56315 Ammonium Transporter Family 56318 Histidine acid phosphatase 56320 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 56322 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim 56324 VHS domain. Domain present in VPS-27, Hrs and STAM 56327 ADP-ribosylation factor family 56330 Protein of unknown function DUF122. This protein family has no known function 56334 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 56336 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 56342 Second step splicing factor 1. SSF is thought to bind rRNA and similarity with peter pan in Drosophila suggests a role in cell proliferation 56350 ADP-ribosylation factor family 56358 Clathrin adaptor complex small chain 56360 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 56362 Sulfotransferase protein 56365 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 56365 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de 56369 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 56373 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 56375 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 56376 LIM domain. This family represents two copies of the LIM structural domain 56378 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 56379 Inward rectifier potassium channel 56380 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 56382 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 56386 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 56388 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 56388 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 56389 Syntaxin 56401 V-type ATPase 116kDa subunit family 56405 Protein-tyrosine phosphatase 56405 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 56413 7 transmembrane receptor (rhodopsin family) 56417 Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r 56421 Phosphofructokinase 56424 TPR Domain 56428 Mitochondrial carrier protein 56430 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 56431 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 56434 Tetraspanin family 56440 PX domain. PX domains bind to phosphoinositides 56440 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 56441 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 56442 TMS membrane protein/tumour differentially expressed protein (TDE) 56447 Clathrin adaptor complex small chain 56448 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 56453 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 56455 Dynein light chain type 1 56457 tRNA pseudouridine synthase 56459 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 56462 Mitochondrial carrier protein 56463 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 56469 pfam02891, zf-MIZ, MIZ zinc finger 56469 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 56470 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 56471 Stathmin family 56473 Fatty acid desaturase 56473 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 56479 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 56479 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56485 Sugar (and other) transporter 56486 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 56490 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 56490 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 56491 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 56492 PMP-22/EMP/MP20/Claudin family 56495 Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell 56496 Tetraspanin family 56508 Cyclic nucleotide-binding domain 56508 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 56517 Sugar (and other) transporter 56521 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 56524 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 56525 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 56526 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 56529 Signal peptidase I 56531 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 56532 Protein kinase domain 56533 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 56534 Hsp20/alpha crystallin family 56543 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 56543 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56544 7 transmembrane receptor (metabotropic glutamate family) 56544 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 56546 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 56547 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 56551 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 56552 7 transmembrane receptor (metabotropic glutamate family) 56552 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 56554 Class I Histocompatibility antigen, domains alpha 1 and 2 56603 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 56605 Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER 56606 Sugar (and other) transporter 56611 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 56612 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 56615 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 56619 Lectin C-type domain. This family includes both long and short form C-type 56622 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 56622 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 56626 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 56628 Class I Histocompatibility antigen, domains alpha 1 and 2 56629 Deoxyribonuclease II 56635 Somatotropin hormone family 56640 Trypsin 56643 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 56646 Galactoside-binding lectin 56648 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 56650 PMP-22/EMP/MP20/Claudin family 56656 7 transmembrane receptor (rhodopsin family) 56659 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56666 Innexin. This family includes the drosophila proteins Ogre and shaking-B, and the C. elegans proteins Unc-7 and Unc-9. Members of this family are integral membrane proteins which are involved in the formation of gap junctions. This family has been named 56667 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 56670 7 transmembrane receptor (rhodopsin family) 56676 Helix-loop-helix DNA-binding domain 56695 Metallo-beta-lactamase superfamily 56696 7 transmembrane receptor (rhodopsin family) 56702 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 56704 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 56708 Ciliary neurotrophic factor 56715 Vacuolar sorting protein 9 (VPS9) domain 56717 Phosphatidylinositol 3- and 4-kinase 56717 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 56717 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 56717 Phosphatidylinositol 3- and 4-kinase 56717 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 56717 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 56720 Tryptophan 2,3-dioxygenase 56720 Tryptophan 2,3-dioxygenase 56734 Tub family 56735 Intermediate filament protein 56736 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 56737 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 56738 Radical SAM superfamily 56738 moaA / nifB / pqqE family 56739 Uncharacterized ACR, COG1354 56741 Fibronectin type III domain 56741 Fibronectin type III domain 56741 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 56741 Fibronectin type III domain 56741 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 56744 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 56745 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 56745 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 56747 CUB domain 56747 Sushi domain (SCR repeat) 56749 Dihydroorotate dehydrogenase 56751 Homeobox domain 56752 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 56753 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 56761 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 56766 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei 56773 Sulfotransferase protein 56774 Sodium:neurotransmitter symporter family 56777 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 56777 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 56780 Histidine acid phosphatase 56783 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 56784 Rap/ran-GAP 56785 Rap/ran-GAP 56787 Helix-loop-helix DNA-binding domain 56791 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 56795 ADP-ribosylation factor family 56805 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 56806 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 56806 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 56807 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 56808 Cache domain 56809 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 56813 7 transmembrane receptor (Secretin family) 56814 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 56816 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 56816 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 56817 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 56819 Uncharacterized ACR, COG1579 56819 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo 56821 7 transmembrane receptor (rhodopsin family) 56825 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 56826 PH domain. PH stands for pleckstrin homology 56826 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 56827 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56832 Interferon alpha/beta domain 56839 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 56839 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 56843 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56843 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56843 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 56847 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 56856 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 56858 7 transmembrane receptor (rhodopsin family) 56859 7 transmembrane receptor (rhodopsin family) 56860 7 transmembrane receptor (rhodopsin family) 56861 7 transmembrane receptor (rhodopsin family) 56863 PMP-22/EMP/MP20/Claudin family 56874 IQ calmodulin-binding motif. Calmodulin-binding motif 56889 Endomembrane protein 70 56891 Galactoside-binding lectin 56893 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 56898 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 56899 Phosphotyrosine interaction domain (PTB/PID) 56899 Phosphotyrosine interaction domain (PTB/PID) 56911 ATP synthase B/B' CF(0) 56913 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 56917 Homeobox domain 56918 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 56923 7 transmembrane receptor (rhodopsin family) 56924 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 56928 Pox virus Ag35 surface protein 56928 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 56928 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 56934 Eukaryotic-type carbonic anhydrase 56938 Helix-loop-helix DNA-binding domain 56946 ENT domain. This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown 56950 MYND finger 56954 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 56956 Homeobox domain 56956 LIM domain. This family represents two copies of the LIM structural domain 56956 Homeobox domain 56956 LIM domain. This family represents two copies of the LIM structural domain 56959 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 56959 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 56959 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 56961 SH2 domain 56986 DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after 56987 HMG (high mobility group) box 56995 Tub family 56999 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 56999 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 56999 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 56999 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 56999 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 56999 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 57006 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 57007 7 transmembrane receptor (rhodopsin family) 57007 7 transmembrane receptor (rhodopsin family) 57014 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 57014 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 57016 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 57020 D-Ala-D-Ala carboxypeptidase 3 (S13) family 57021 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 57021 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 57023 Disintegrin 57023 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 57023 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 57025 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 57025 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 57027 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 57029 Disintegrin 57029 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 57037 MYND finger 57047 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 57048 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 57050 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 57053 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 57053 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 57057 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 57060 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 57080 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 57080 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 57088 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 57089 GDA1/CD39 (nucleoside phosphatase) family 57094 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 57104 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 57104 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 57105 7 transmembrane receptor (rhodopsin family) 57113 Transient receptor 57116 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57117 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 57119 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structur 57120 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 57121 7 transmembrane receptor (rhodopsin family) 57122 Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis 57127 Ammonium Transporter Family 57132 Eukaryotic protein of unknown function, DUF279 57134 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 57136 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 57139 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 57139 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 57142 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 57142 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 57142 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 57147 Protein kinase domain 57152 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 57154 C2 domain 57154 C2 domain 57154 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 57154 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 57156 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 57165 Connexin 57167 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 57170 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 57171 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 57178 pfam02891, zf-MIZ, MIZ zinc finger 57181 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 57182 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 57186 Rap/ran-GAP 57188 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 57191 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 57209 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57211 7 transmembrane receptor (Secretin family) 57211 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 57218 Phage maturation protein 57231 PX domain. PX domains bind to phosphoinositides 57231 PX domain. PX domains bind to phosphoinositides 57232 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57233 LIM domain. This family represents two copies of the LIM structural domain 57248 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 57249 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 57249 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 57250 7 transmembrane receptor (rhodopsin family) 57251 7 transmembrane receptor (rhodopsin family) 57252 7 transmembrane receptor (rhodopsin family) 57255 PMP-22/EMP/MP20/Claudin family 57257 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 57257 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 57259 BTG1 family. A novel family of anti-proliferative proteins 57260 7 transmembrane receptor (rhodopsin family) 57260 7 transmembrane receptor (rhodopsin family) 57265 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 57266 Small cytokines (intecrine/chemokine) 57267 Phosphotyrosine interaction domain (PTB/PID) 57267 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 57269 7 transmembrane receptor (rhodopsin family) 57269 7 transmembrane receptor (rhodopsin family) 57270 7 transmembrane receptor (rhodopsin family) 57270 7 transmembrane receptor (rhodopsin family) 57271 7 transmembrane receptor (rhodopsin family) 57271 7 transmembrane receptor (rhodopsin family) 57272 7 transmembrane receptor (rhodopsin family) 57274 Monocarboxylate transporter 57277 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 57279 Mitochondrial carrier protein 57282 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 57294 Ribosomal protein S27 57295 Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this meth 57298 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 57301 7 transmembrane receptor (rhodopsin family) 57302 7 transmembrane receptor (rhodopsin family) 57305 7 transmembrane receptor (rhodopsin family) 57315 WD domain, G-beta repeat 57315 WD domain, G-beta repeat 57320 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 57321 Phosphoenolpyruvate carboxykinase 57325 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 57325 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 57332 'chromo' (CHRromatin Organization MOdifier) domain 57335 Zinc finger 57335 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57337 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 57338 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 57339 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 57340 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 57341 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 57342 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 57349 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 57357 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 57369 Connexin 57370 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 57376 HMG (high mobility group) box 57377 Mannosyl oligosaccharide glucosidase. This is a family of eukaryotic enzymes belonging to glycosyl hydrolase family 63. They catalyse the specific cleavage of the non-reducing terminal glucose residue from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide 57385 7 transmembrane receptor (rhodopsin family) 57402 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 57403 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 57404 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 57404 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 57409 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 57414 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 57418 WD domain, G-beta repeat 57419 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 57419 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 57426 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 57429 Sulfotransferase protein 57430 Sulfotransferase protein 57436 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 57442 Slow voltage-gated potassium channel 57443 F-box domain 57446 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 57446 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 57447 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 57448 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 57448 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 57449 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 57451 Plant PEC family metallothionein 57451 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int 57452 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 57453 Fibronectin type III domain 57460 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 57466 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 57467 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 57474 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57475 Intermediate filament protein 57475 PH domain. PH stands for pleckstrin homology 57475 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 57477 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 57478 Ubiquitin carboxyl-terminal hydrolase family 2 57480 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 57484 Zinc finger, C3HC4 type (RING finger) 57488 C2 domain 57492 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 57492 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 57497 Fibronectin type III domain 57502 Carboxylesterase 57502 Carboxylesterase 57504 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 57504 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 57504 Early E1A protein. This is a family of adenovirus early E1A proteins. The E1A protein is 32 kDa it can however be cleaved to yield the 28 kDa protein. The E1A protein is responsible for the transcriptional activation of the early genes with in the viral g 57504 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 57505 tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases 57512 7 transmembrane receptor (metabotropic glutamate family) 57512 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 57512 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 57512 Dolichyl-phosphate-mannose-protein mannosyltransferase. This is a family of Dolichyl-phosphate-mannose-protein mannosyltransferase proteins EC:2.4.1.109. These proteins are responsible for O-linked glycosylation of proteins, they catalyse the reaction:- D 57513 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 57514 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 57515 TMS membrane protein/tumour differentially expressed protein (TDE) 57516 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 57519 START domain 57520 C2 domain 57520 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 57520 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 57522 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 57522 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 57522 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 57524 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 57526 Cadherin domain 57528 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 57529 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 57530 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 57531 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 57531 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 57534 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 57534 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 57536 Integral membrane protein DUF106. This archaebacterial protein family has no known function. Members are predicted to be integral membrane proteins 57537 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 57538 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 57540 AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance 57540 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 57542 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 57542 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 57544 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 57546 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 57549 Fibronectin type III domain 57551 Protein kinase domain 57551 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 57552 Carboxylesterase 57553 LIM domain. This family represents two copies of the LIM structural domain 57553 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 57555 Carboxylesterase 57555 Carboxylesterase 57558 Ubiquitin carboxyl-terminal hydrolases family 2 57559 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 57563 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 57565 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 57565 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 57568 Rap/ran-GAP 57568 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 57569 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 57570 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 57573 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57576 Kinesin motor domain 57586 C2 domain 57591 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 57593 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 57604 ubiE/COQ5 methyltransferase family 57605 Phosphatidylinositol transfer protein 57605 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 57609 AMP-binding enzyme 57610 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 57611 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 57612 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 57612 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 57621 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 57622 Fibronectin type III domain 57622 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 57622 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 57626 BTB/POZ domain. The BTB (for BR-C 57626 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Kelch is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Sev 57628 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 57629 Doublecortin 57630 Zinc finger, C3HC4 type (RING finger) 57630 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 57634 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 57636 PH domain. PH stands for pleckstrin homology 57636 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 57642 Fibronectin type III domain 57642 von Willebrand factor type A domain 57642 Thrombospondin N-terminal -like domain 57645 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 57646 Ubiquitin carboxyl-terminal hydrolase family 2 57649 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 57649 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 57655 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 57657 Cyclic nucleotide-binding domain 57657 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 57659 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 57659 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 57661 Zinc finger, C3HC4 type (RING finger) 57661 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 57663 Ubiquitin carboxyl-terminal hydrolase family 2 57665 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 57669 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 57671 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 57671 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 57674 Zinc finger, C3HC4 type (RING finger) 57677 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57679 Vacuolar sorting protein 9 (VPS9) domain 57680 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 57680 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 57684 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 57684 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 57685 Cache domain 57685 Cache domain 57689 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 57690 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 57690 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 57690 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 57692 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 57695 Ubiquitin carboxyl-terminal hydrolase family 2 57695 Ubiquitin carboxyl-terminal hydrolases family 2 57696 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 57697 ERCC4 domain. This domain is predicted to be a nuclease domain 57697 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 57699 C2 domain 57703 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 57703 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 57705 Beige/BEACH domain 57705 WD domain, G-beta repeat 57706 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 57706 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 57706 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 57708 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 57709 Amino acid permease 57713 Sterile alpha motif (SAM)/Pointed domain 57713 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 57713 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 57722 Fibronectin type III domain 57727 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 57729 Protein kinase domain 57729 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 57731 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 57732 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 57733 Glycosyl hydrolase family 1 57737 Homeobox domain 57738 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 57741 Uncharacterised protein family (UPF0120) 57746 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 57746 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 57749 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 57749 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 57756 LIM domain. This family represents two copies of the LIM structural domain 57764 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 57775 Ubiquitin carboxyl-terminal hydrolase family 2 57775 Ubiquitin carboxyl-terminal hydrolase family 2 57775 Ubiquitin carboxyl-terminal hydrolases family 2 57780 ATP1G1/PLM/MAT8 family 57782 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 57794 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 57801 Helix-loop-helix DNA-binding domain 57804 DNA polymerase delta, subunit 4 57808 Ribosomal protein L35Ae 57809 Ribosomal protein L35Ae 57810 Fibronectin type III domain 57811 7 transmembrane receptor (rhodopsin family) 57813 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 57815 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 57815 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 57815 Parvovirus non-structural protein NS1. This family also contains the NS2 protein. Parvoviruses encode two non-structural proteins, NS1 and NS2. The mRNA for NS2 contains the coding sequence for the first 87 amino acids of NS1, then by an alternative splic 57819 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 57829 Zona pellucida-like domain 57834 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 57835 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 57835 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 57835 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 57835 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 57876 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 57908 Type IV secretion system CagX conjugation protein 57913 Death domain 58155 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 58157 Globin 58158 Helix-loop-helix DNA-binding domain 58168 7 transmembrane receptor (rhodopsin family) 58170 Amiloride-sensitive sodium channel 58175 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 58176 Ammonium Transporter Family 58177 Mitochondrial carrier protein 58178 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 58180 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 58180 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 58182 7 transmembrane receptor (rhodopsin family) 58184 Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes 58185 Radical SAM superfamily 58187 PMP-22/EMP/MP20/Claudin family 58187 PMP-22/EMP/MP20/Claudin family 58190 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 58190 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 58193 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 58198 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 58198 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 58206 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 58208 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 58222 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 58223 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 58226 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 58226 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 58227 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 58227 Rifin/stevor family. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmem 58230 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 58240 PX domain. PX domains bind to phosphoinositides 58242 NUDIX domain 58242 NUDIX domain 58243 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 58250 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 58475 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 58477 ADP-ribosylation factor family 58484 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 58485 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 58489 Dienelactone hydrolase family 58492 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 58492 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 58497 DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at 58500 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 58508 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 58508 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 58508 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 58508 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 58510 Proline dehydrogenase 58511 Deoxyribonuclease II 58511 Deoxyribonuclease II 58512 Guanylate-kinase-associated protein (GKAP) protein 58517 PWI domain 58517 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 58520 Uncharacterised protein family (UPF0143). This family of uncharacterised proteins are integral membrane proteins. They may contain 4 transmembrane helices. The family contains a conserved arginine and histidine that may be functionally important 58522 B-box zinc finger 58524 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 58533 PX domain. PX domains bind to phosphoinositides 58538 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 58800 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 58800 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 58802 Calcium-activated potassium channel, beta subunit 58802 Calcium-activated potassium channel, beta subunit 58803 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 58805 Helix-loop-helix DNA-binding domain 58807 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 58807 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 58808 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 58810 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 58813 pfam02872, 5_nucleotidaseC, 5'-nucleotidase, C-terminal domain 58813 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 58814 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 58817 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 58820 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 58822 Protein kinase domain 58845 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 58851 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 58851 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 58852 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 58852 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 58853 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 58853 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 58854 Proteasome A-type and B-type 58857 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 58860 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 58860 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 58861 7 transmembrane receptor (rhodopsin family) 58866 Trehalase 58867 Synaptogyrin. This family of proteins is distantly related to pfam01284 58868 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 58868 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 58868 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 58911 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 58918 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 58919 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 58920 Glypican 58922 Homeobox domain 58923 B-box zinc finger 58923 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 58924 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 58927 Ribosomal protein L36e 58934 MYND finger 58934 Programmed cell death protein 2, C-terminal domain 58935 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox 58936 Protein kinase domain 58936 POLO box duplicated region 58937 Somatotropin hormone family 58945 Dynein light chain type 1 58946 Fibronectin type III domain 58947 Trypsin 58949 7 transmembrane receptor (rhodopsin family) 58952 Peptidase family M28D 58953 Sulfotransferase protein 58958 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 58959 7 transmembrane receptor (Secretin family) 58961 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 58962 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 58963 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 58963 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 58964 Nucleoside diphosphate kinase 58968 Extracellular link domain 58970 Fatty acid desaturase 58971 ATP1G1/PLM/MAT8 family 58974 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 58978 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 58978 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 58980 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 58980 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 58982 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 58985 Tissue factor 58992 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 58992 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 58994 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 59004 pfam02891, zf-MIZ, MIZ zinc finger 59007 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 59009 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 59012 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 59012 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 59013 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 59025 Ubiquitin carboxyl-terminal hydrolase family 2 59028 RNA 3'-terminal phosphate cyclase 59030 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 59031 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 59033 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 59044 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 59049 Sugar (and other) transporter 59053 Domain of unknown function (DUF383). domo keywords :chromosome c26f1.12c 41.3 59053 Domain of unknown function (DUF384). domo keywords :chromosome c26f1.12c 41.3 59058 Helix-loop-helix DNA-binding domain 59069 Tropomyosin 59075 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 59076 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 59077 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 59082 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 59083 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 59084 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 59086 Transforming growth factor beta like domain 59086 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 59087 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 59087 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 59088 Somatotropin hormone family 59095 ATP1G1/PLM/MAT8 family 59103 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 59106 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 59107 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 59108 Globin 59113 Monooxygenase. This family includes diverse enzymes that utilise FAD 59116 Acyl CoA binding protein 59117 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 59117 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 59264 7 transmembrane receptor (rhodopsin family) 59265 PH domain. PH stands for pleckstrin homology 59266 Cyclic nucleotide-binding domain 59266 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 59268 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 59269 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 59271 Galactose binding lectin domain 59272 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 59277 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 59283 PMP-22/EMP/MP20/Claudin family 59284 PMP-22/EMP/MP20/Claudin family 59286 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 59289 7 transmembrane receptor (rhodopsin family) 59293 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 59296 Deoxyribonuclease II 59300 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 59303 Uncharacterised protein family (UPF0121). Uncharacterised integral membrane protein family 59305 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 59307 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 59312 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 59317 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 59317 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 59323 Furin-like cysteine rich region 59323 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 59324 TASK K+ channel 59326 Cyclic nucleotide-binding domain 59326 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 59327 Helix-loop-helix DNA-binding domain 59328 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 59328 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 59340 7 transmembrane receptor (rhodopsin family) 59341 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 59341 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 59341 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 59341 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 59342 Serine carboxypeptidase 59344 Lipoxygenase 59344 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 59347 Translationally controlled tumor protein 59349 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 59350 7 transmembrane receptor (rhodopsin family) 60312 PH domain. PH stands for pleckstrin homology 60314 Uncharacterised protein family (UPF0160). This family of proteins contains a large number of metal binding residues. The patterns are suggestive of a phosphoesterase function. The conserved DHH motif may mean this family is related to pfam01368 60315 Uncharacterised protein family (UPF0160). This family of proteins contains a large number of metal binding residues. The patterns are suggestive of a phosphoesterase function. The conserved DHH motif may mean this family is related to pfam01368 60326 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 60326 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 60327 Transglutaminase family 60327 Transglutaminase family, C-terminal ig like domain 60327 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 60328 Kinase associated domain 1 60329 Helix-loop-helix DNA-binding domain 60331 Josephin 60335 Transglutaminase family 60335 Transglutaminase family, C-terminal ig like domain 60335 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 60338 ATP1G1/PLM/MAT8 family 60349 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 60349 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 60351 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 60351 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 60353 Peridinin-chlorophyll A binding protein 60355 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o 60356 Pyridoxal-dependent decarboxylase conserved domain 60357 Dolichyl-phosphate-mannose-protein mannosyltransferase 60361 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 60363 PMP-22/EMP/MP20/Claudin family 60367 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 60371 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 60372 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 60374 Intermediate filament protein 60374 Intermediate filament tail domain 60377 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 60379 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 60381 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 60383 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 60384 Coatomer WD associated domain 60386 Mitochondrial carrier protein 60390 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 60393 Sulfotransferase protein 60394 Helix-loop-helix DNA-binding domain 60395 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 60398 HIT family 60399 7 transmembrane receptor (rhodopsin family) 60409 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 60412 Sec8 exocyst complex component specific domain 60414 Ammonium Transporter Family 60418 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 60423 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 60425 C2 domain 60427 Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site i 60430 Apoptosis regulator proteins, Bcl-2 family 60431 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 60432 7 transmembrane receptor (Secretin family) 60432 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 60433 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 60434 Apoptosis regulator proteins, Bcl-2 family 60437 Cadherin domain 60441 Phosphatidylethanolamine-binding protein 60442 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 60444 Calcium-activated BK potassium channel alpha subunit 60444 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 60446 Homeobox domain 60448 7 transmembrane receptor (rhodopsin family) 60450 SH2 domain 60451 7 transmembrane receptor (rhodopsin family) 60460 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 60463 7 transmembrane receptor (rhodopsin family) 60464 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 60465 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 60468 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 60481 GNS1/SUR4 family 60484 Extracellular link domain 60485 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 60495 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular 60510 C2 domain 60527 Fatty acid desaturase 60527 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 60527 Fatty acid desaturase 60529 OAR domain 60529 Homeobox domain 60560 Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex 60563 jmjC domain 60564 C2 domain 60565 C2 domain 60566 C2 domain 60567 C2 domain 60568 C2 domain 60575 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 60576 Trehalase 60577 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 60580 HIT family 60581 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 60584 LIF / OSM family 60586 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 60587 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 60589 Protein kinase domain 60589 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 60590 7 transmembrane receptor (metabotropic glutamate family) 60590 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 60591 Calcium-activated potassium channel, beta subunit 60595 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 60597 Phosphotyrosine interaction domain (PTB/PID) 60598 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 60613 KCNQ1 voltage-gated potassium channel 60613 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 60613 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 60628 7 transmembrane receptor (rhodopsin family) 60661 Helix-loop-helix DNA-binding domain 60662 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 60663 Furin-like cysteine rich region 60663 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 60664 SH2 domain 60665 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 60666 NAD-dependent glycerol-3-phosphate dehydrogenase 60667 7 transmembrane receptor (rhodopsin family) 60668 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 60669 7 transmembrane receptor (rhodopsin family) 60670 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 60671 FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi 60681 FKBP-type peptidyl-prolyl cis-trans isomerase 60682 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 60685 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 63827 Extracellular link domain 63827 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 63836 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 63839 LIM domain. This family represents two copies of the LIM structural domain 63843 7 transmembrane receptor (rhodopsin family) 63847 ATP1G1/PLM/MAT8 family 63848 ATP1G1/PLM/MAT8 family 63849 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 63849 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 63849 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 63852 Phosphotriesterase family 63857 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 63859 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 63864 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 63865 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this 63866 Amiloride-sensitive sodium channel 63867 UDP-glucoronosyl and UDP-glucosyl transferase 63873 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 63873 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 63878 Stanniocalcin family 63882 Amiloride-sensitive sodium channel 63883 Slow voltage-gated potassium channel 63885 3'5'-cyclic nucleotide phosphodiesterase 63889 7 transmembrane receptor (rhodopsin family) 63897 Adaptin N terminal region. This family consists of the N terminal region of various alpha 63904 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 63917 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 63923 Fibronectin type III domain 63923 Fibrinogen beta and gamma chains, C-terminal globular domain 63931 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 63935 Hexon-associated protein (IIIa) 63938 3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin 63938 Prephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis 63938 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold 63938 TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD 63941 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 63943 TPR Domain 63950 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 63951 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 63951 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 63953 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 63954 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 63954 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 63954 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 63959 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 63973 Helix-loop-helix DNA-binding domain 63973 Helix-loop-helix DNA-binding domain 63974 Helix-loop-helix DNA-binding domain 63976 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 63985 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 63986 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 63995 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 63997 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 64001 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 64005 Myosin head (motor domain) 64009 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 64010 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 64013 Phosducin 64017 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 64017 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 64020 7 transmembrane receptor (rhodopsin family) 64021 7 transmembrane receptor (rhodopsin family) 64023 CUB domain 64028 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 64032 Thrombospondin type 1 domain 64035 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 64036 Sushi domain (SCR repeat) 64036 Sushi domain (SCR repeat) 64037 Sugar (and other) transporter 64040 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 64042 7 transmembrane receptor (rhodopsin family) 64043 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 64044 Death effector domain 64045 Glutaredoxin 64046 Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o 64050 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 64051 Sugar (and other) transporter 64055 Guanylin precursor 64057 Extracellular link domain 64057 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 64060 Sec1 family 64061 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 64066 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 64066 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 64066 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 64072 Cadherin domain 64072 Cadherin domain 64074 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 64075 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 64076 Sulfate transporter family. Mutations may lead to several human diseases 64078 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 64081 Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal 64086 Protein kinase domain 64088 PX domain. PX domains bind to phosphoinositides 64089 PX domain. PX domains bind to phosphoinositides 64089 PX domain. PX domains bind to phosphoinositides 64089 PX domain. PX domains bind to phosphoinositides 64093 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 64094 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 64095 7 transmembrane receptor (rhodopsin family) 64096 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 64096 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 64096 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 64097 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 64098 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 64099 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 64102 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 64103 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 64104 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 64106 7 transmembrane receptor (rhodopsin family) 64107 7 transmembrane receptor (rhodopsin family) 64110 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 64114 Uncharacterized protein family UPF0005 64116 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 64118 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou 64119 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 64120 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 64124 7 transmembrane receptor (Secretin family) 64129 Papain family cysteine protease 64130 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 64131 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 64132 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 64133 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 64134 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 64138 Papain family cysteine protease 64144 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 64145 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 64147 Kinesin motor domain 64147 Kinesin motor domain 64147 Kinesin motor domain 64151 Chromosome condensation protein 3, C-terminal region. Cnd3 is a Member of the five subunit condensin complex. Each subunit is essential for mitotic condensation 64155 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 64156 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 64156 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 64157 Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme 64159 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 64159 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 64167 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 64174 Renal dipeptidase 64176 Sugar (and other) transporter 64177 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 64180 Renal dipeptidase 64185 CAP protein 64187 ADP-ribosylation factor family 64188 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 64189 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu 64190 ATP1G1/PLM/MAT8 family 64191 Ergosterol biosynthesis ERG4/ERG24 family 64195 Hepatic lectin, N-terminal domain 64195 Lectin C-type domain. This family includes both long and short form C-type 64198 Insulinase (Peptidase family M16) 64201 Mitochondrial carrier protein 64202 Calreticulin family 64203 Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci 64204 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 64204 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 64205 Ribosomal protein L10 64206 Tryptophan 2,3-dioxygenase 64211 Homeobox domain 64211 LIM domain. This family represents two copies of the LIM structural domain 64213 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 64218 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 64225 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 64231 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64231 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64231 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64232 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64236 LIM domain. This family represents two copies of the LIM structural domain 64242 Globin 64282 PAP/25A associated domain 64284 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 64285 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 64290 Fork head domain 64291 Oxysterol-binding protein 64292 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 64293 Protein kinase domain 64294 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 64297 7 transmembrane receptor (metabotropic glutamate family) 64298 Ribosomal protein L31e 64300 Formate--tetrahydrofolate ligase 64302 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 64305 Sulfotransferase protein 64306 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 64307 Ribosomal protein L24e 64314 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 64315 Tetraspanin family 64316 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc 64317 Glutathione peroxidase 64321 HMG (high mobility group) box 64324 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 64324 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 64324 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 64324 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 64324 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 64324 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 64333 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 64340 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 64340 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 64342 PX domain. PX domains bind to phosphoinositides 64342 PX domain. PX domains bind to phosphoinositides 64343 Intermediate filament protein 64347 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 64348 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 64349 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 64350 von Willebrand factor type A domain 64352 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 64353 LIM domain. This family represents two copies of the LIM structural domain 64354 7 transmembrane receptor (rhodopsin family) 64360 Ribosomal protein L23 64361 Somatotropin hormone family 64362 Intermediate filament protein 64363 Protein kinase domain 64363 Raf-like Ras-binding domain 64363 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 64365 Eukaryotic cobalamin-binding protein 64367 Cyclophilin type peptidyl-prolyl cis-trans isomerase 64368 Somatotropin hormone family 64369 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 64370 metallopeptidase family M24 64372 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 64374 Eukaryotic aspartyl protease 64377 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 64378 7 transmembrane receptor (rhodopsin family) 64380 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64381 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64382 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 64385 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 64386 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 64386 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 64386 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 64386 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 64386 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 64386 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 64388 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 64392 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 64395 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 64396 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 64403 Cadherin domain 64403 Cadherin domain 64403 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 64403 Cadherin domain 64403 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 64405 Cadherin domain 64405 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 64409 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 64410 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 64411 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 64412 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 64418 Glucose inhibited division protein A 64420 Sushi domain (SCR repeat) 64422 Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 64422 Autophagocytosis associated protein, N-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 64424 Uncharacterized BCR, YceG family COG1559 64428 pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain 64434 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 64434 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 64440 C2 domain 64441 HSF-type DNA-binding 64442 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 64443 7 transmembrane receptor (rhodopsin family) 64444 LIM domain. This family represents two copies of the LIM structural domain 64445 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 64450 7 transmembrane receptor (rhodopsin family) 64451 AMP-binding enzyme 64451 AMP-binding enzyme 64452 Sodium:solute symporter family 64452 Sodium:solute symporter family 64452 Sodium:solute symporter family 64454 Sodium:solute symporter family 64454 Sodium:solute symporter family 64454 Sodium:solute symporter family 64455 ADP-ribosylation factor family 64455 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 64456 Reeler domain 64456 Thrombospondin type 1 domain 64457 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 64459 Trypsin 64459 CUB domain 64459 Sushi domain (SCR repeat) 64461 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 64462 Protein kinase domain 64463 SURF1 family 64464 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 64467 GDA1/CD39 (nucleoside phosphatase) family 64468 Hyaluronidase 64468 Hyaluronidase 64470 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 64472 Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex 64473 Inositol monophosphatase family 64473 Inositol monophosphatase family 64478 CUB domain 64478 Sushi domain (SCR repeat) 64478 Sushi domain (SCR repeat) 64508 Adenylate and Guanylate cyclase catalytic domain 64512 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 64513 V-type ATPase 116kDa subunit family 64514 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 64516 Sec1 family 64518 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 64519 GDA1/CD39 (nucleoside phosphatase) family 64520 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 64521 Tetraspanin family 64522 Sodium:solute symporter family 64529 Papain family cysteine protease 64532 Adenylate and Guanylate cyclase catalytic domain 64534 Protein kinase domain 64537 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular 64540 Tetraspanin family 64543 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 64543 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 64544 Glycosyltransferase family 43 64551 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 64553 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 64555 Adenylate kinase 64557 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 64558 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 64560 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 64563 Fibronectin type III domain 64564 Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1 64566 wnt family 64567 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 64567 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 64567 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 64570 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 64571 Homeobox domain 64572 Homeobox domain 64573 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 64574 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 64574 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 64576 Fibronectin type III domain 64577 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 64577 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 64579 Sulfotransferase protein 64580 Sulfotransferase protein 64582 7 transmembrane receptor (rhodopsin family) 64582 7 transmembrane receptor (rhodopsin family) 64591 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 64598 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 64619 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 64620 Golgi 4-transmembrane spanning transporter 64621 Alkaline phosphatase 64623 Prenylated rab acceptor (PRA1) 64624 Cullin family 64626 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 64632 Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocys 64633 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 64634 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 64635 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 64636 7 transmembrane receptor (rhodopsin family) 64637 Helix-loop-helix DNA-binding domain 64638 Ribosomal L28e protein family 64641 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 64652 PX domain. PX domains bind to phosphoinositides 64657 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 64659 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 64664 ADP-ribosylation factor family 64665 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 64668 Lectin C-type domain. This family includes both long and short form C-type 64670 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 64672 Phospholamban. The regulation of calcium levels across the membrane of the sarcoplasmic reticulum involves the interplay of many membrane proteins. Phospholamban is a 52 residue integral membrane protein that is involved in reversibly inhibiting the Ca(2 64677 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 64678 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 64678 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 64679 Transglutaminase family 64679 Transglutaminase family, C-terminal ig like domain 64679 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 64680 7 transmembrane receptor (Secretin family) 64681 Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein 64702 B-box zinc finger 64707 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 64714 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 64718 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 64744 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 64746 Acyl CoA binding protein 64748 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 64754 MYND finger 64772 Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates 64778 Fibronectin type III domain 64780 LIM domain. This family represents two copies of the LIM structural domain 64780 FAD binding domain. This domain is involved in FAD binding in a number of enzymes 64780 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 64794 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 64801 Arv1-like family. Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis 64803 7 transmembrane receptor (rhodopsin family) 64805 7 transmembrane receptor (rhodopsin family) 64805 7 transmembrane receptor (rhodopsin family) 64816 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 64816 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 64816 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 64817 Sushi domain (SCR repeat) 64817 von Willebrand factor type A domain 64817 Pentaxin family. Pentaxins are also known as pentraxins 64817 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc 64822 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 64823 Protein kinase domain 64828 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 64829 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 64830 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 64833 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 64834 GNS1/SUR4 family 64840 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 64841 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 64843 LIM domain. This family represents two copies of the LIM structural domain 64846 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 64848 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 64849 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 64855 PH domain 64855 Acetaldehyde dehydrogenase 64856 Fibronectin type III domain 64857 PH domain. PH stands for pleckstrin homology 64857 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 64862 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 64865 Cadherin domain 64881 Cadherin domain 64918 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 64919 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 64919 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 64922 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 64926 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 64928 Ribosomal protein L14p/L23e 64931 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 64931 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 64933 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 64940 Stromal antigen (SA/STAG) protein 64960 Ribosomal protein S15 64965 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 64978 Phosphatidylethanolamine-binding protein 65008 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 65009 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 65009 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 65010 Sulfate transporter family. Mutations may lead to several human diseases 65010 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 65010 Sulfate transporter family. Mutations may lead to several human diseases 65010 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 65010 Sulfate transporter family. Mutations may lead to several human diseases 65010 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 65012 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 65018 Protein kinase domain 65020 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 65020 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 65022 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 65024 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 65024 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 65029 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 65029 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 65029 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 65030 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 65031 C2 domain 65032 7 transmembrane receptor (rhodopsin family) 65035 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 65035 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 65036 Carboxylesterase 65037 Fibronectin type III domain 65039 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 65040 Guanylate-kinase-associated protein (GKAP) protein 65042 Tricarboxylate carrier 65043 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 65044 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 65045 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 65046 Clathrin adaptor complex small chain 65047 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 65048 Cadherin domain 65048 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 65050 Homeobox domain 65051 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 65052 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 65055 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 65081 Sec1 family 65082 Sec1 family 65083 Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap 65083 Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap 65083 Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap 65086 7 transmembrane receptor (rhodopsin family) 65094 jmjC domain 65096 Galactose binding lectin domain 65096 7 transmembrane receptor (Secretin family) 65096 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 65096 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou 65098 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 65106 Prenylated rab acceptor (PRA1) 65109 Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3 65109 Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3 65110 Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3 65110 Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3 65114 Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vp 65116 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 65129 PMP-22/EMP/MP20/Claudin family 65130 PMP-22/EMP/MP20/Claudin family 65131 PMP-22/EMP/MP20/Claudin family 65132 PMP-22/EMP/MP20/Claudin family 65134 Syntaxin 65136 Ribosomal protein S8e 65139 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 65140 7 transmembrane receptor (rhodopsin family) 65142 ADP-ribosylation factor family 65144 Tuberin 65151 Endoribonuclease L-PSP. Endoribonuclease active on single-stranded mRNA. Inhibits protein synthesis by cleavage of mRNA. Previously thought to inhibit protein synthesis initiation. This protein may also be involved in the regulation of purine biosynthesi 65152 Phosphofructokinase 65154 Thrombospondin type 1 domain 65156 Dihydroorotate dehydrogenase 65160 Nucleotide-sensitive chloride conductance regulator (ICln) 65162 Iodothyronine deiodinase 65165 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 65166 Signal peptidase I 65168 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy 65169 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy 65170 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 65171 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 65172 Protein kinase domain 65172 LIM domain. This family represents two copies of the LIM structural domain 65172 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 65178 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 65180 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 65180 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 65183 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 65185 Sulfotransferase protein 65187 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 65187 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 65190 Radical SAM superfamily 65191 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 65191 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 65193 Homeobox domain 65194 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 65195 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 65195 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 65196 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 65197 Sugar (and other) transporter 65198 Cyclic nucleotide-binding domain 65198 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 65198 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 65200 Monocarboxylate transporter 65201 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 65202 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 65204 Calponin family repeat 65205 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 65205 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 65205 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 65206 Calcium-activated SK potassium channel 65206 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 65207 Ammonium Transporter Family 65209 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 65210 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 65217 Cadherin domain 65218 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 65220 ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus 65221 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 65241 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 65256 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 65256 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 65262 ATP synthase alpha/beta chain, C terminal domain 65262 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 65262 pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 65270 Glycosyltransferase family 6 65272 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 65275 7 transmembrane receptor (rhodopsin family) 65276 7 transmembrane receptor (rhodopsin family) 65278 Protein kinase domain 65279 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 65279 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 65793 Ribosomal protein S7e 65945 Cadherin domain 65961 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 65961 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 65969 CUB domain 65970 LIM domain. This family represents two copies of the LIM structural domain 65972 Gamma interferon inducible lysosomal thiol reductase (GILT). This family includes the two characterized human gamma-interferon-inducible lysosomal thiol reductase (GILT) sequences. It also contains several other eukaryotic putative proteins with similari 65986 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 65987 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 65988 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 65989 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 65992 PCI domain 66002 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 66011 Importin-beta N-terminal domain 66012 Adenylate and Guanylate cyclase catalytic domain 66014 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 66016 Calcium-activated potassium channel, beta subunit 66017 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 66019 Homeobox domain 66021 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 66022 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 66022 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 66023 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro 66024 7 transmembrane receptor (rhodopsin family) 66025 7 transmembrane receptor (rhodopsin family) 66026 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 66026 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 66027 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 66028 Prenylated rab acceptor (PRA1) 66029 Indoleamine 2,3-dioxygenase 66030 Synaptophysin / synaptoporin 66035 Sugar (and other) transporter 66052 Succinate dehydrogenase cytochrome b subunit 66053 Cyclophilin type peptidyl-prolyl cis-trans isomerase 66056 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 66061 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 66072 Domain of unknown function (DUF339). This family of small proteins are uncharacterised 66084 Uncharacterized ACR, YagE family COG1723 66090 Yippee putative zinc-binding protein 66090 Yippee putative zinc-binding protein 66094 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 66101 Cyclophilin type peptidyl-prolyl cis-trans isomerase 66109 Tetraspanin family 66111 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 66112 MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold 66112 MOSC domain. The MOSC (MOCO sulfurase C-terminal) domain is a superfamily of beta-strand-rich domains identified in the molybdenum cofactor sulfurase and several other proteins from both prokaryotes and eukaryotes. These MOSC domains contain an absolutely 66113 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 66116 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 66118 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 66120 FKBP-type peptidyl-prolyl cis-trans isomerase 66124 Josephin 66144 ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F 66148 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 66153 F-box domain 66154 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 66166 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 66168 Uncharacterized protein family UPF0005 66177 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 66181 Nucleolar RNA-binding protein, Nop10p family. Nop10p is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA 66182 ADP-ribosylation factor family 66192 Peptidase family M3 66194 Delta 1-pyrroline-5-carboxylate reductase 66197 Cyclin-dependent kinase regulatory subunit 66198 Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins 66204 Acylphosphatase 66208 Steroid binding domain 66211 Ribosomal protein L3 66212 Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG 66222 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 66234 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 66235 Eukaryotic initiation factor 1A 66237 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 66248 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 66251 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 66269 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 66283 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 66284 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 66286 Signal peptidase I 66290 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 66307 Isochorismatase family. This family are hydrolase enzymes 66313 C2 domain 66313 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 66313 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 66314 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 66322 Intermediate filament protein 66328 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 66332 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 66333 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 66333 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 66335 V-ATPase subunit C 66354 SKIP/SNW domain. This domain is found in chromatin proteins 66355 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 66368 RNA 3'-terminal phosphate cyclase 66369 Uncharacterized protein family UPF0034 66369 Histidine biosynthesis protein. Proteins involved in steps 4 and 6 of the histidine biosynthesis pathway are contained in this family. Histidine is formed by several complex and distinct biochemical reactions catalysed by eight enzymes. The enzymes in thi 66369 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 66369 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour 66373 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 66373 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 66383 NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do 66384 SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. Th 66387 NUDIX domain 66392 Somatotropin hormone family 66399 Elongation factor TS 66401 NUDIX domain 66406 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 66407 Ribosomal protein S15 66408 HIT family 66409 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 66410 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 66411 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 66420 RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain 66420 RNA polymerase Rpb5, N-terminal domain. Rpb5 has a bipartite structure which includes a eukaryote-specific N-terminal domain and a C-terminal domain resembling the archaeal RNAP subunit H. The N-terminal domain is involved in DNA binding and is part of th 66427 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 66435 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 66441 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the 66445 Cytochrome C1 family 66446 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 66446 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 66447 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 66449 Uncharacterised protein family (UPF0108) 66454 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase, Glycerol-3-phosphate cytidylyltransferase 66454 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 66454 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase, Glycerol-3-phosphate cytidylyltransferase 66454 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 66459 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 66461 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 66464 Transcription initiation factor TFIID subunit A 66480 Ribosomal L15 66481 Ribosomal protein S21e 66482 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 66483 Ribosomal protein L44 66505 MYND finger 66505 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 66513 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 66513 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 66515 Cullin family 66541 Signal peptidase I 66549 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 66552 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 66552 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 66569 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 66576 Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formati 66586 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 66588 Adenylate kinase 66589 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 66590 tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain 66592 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 66592 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 66597 B-box zinc finger 66607 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 66612 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 66614 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 66615 Peptidase family C54 66616 PX domain. PX domains bind to phosphoinositides 66622 Putative zinc finger in N-recognin 66647 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 66659 Histidine acid phosphatase 66663 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 66667 jmjC domain 66673 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 66676 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 66682 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 66686 LCCL domain 66689 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 66691 Vacuolar sorting protein 9 (VPS9) domain 66696 PX domain. PX domains bind to phosphoinositides 66698 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 66700 Eukaryotic protein of unknown function, DUF279 66701 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 66702 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 66704 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol 66711 Uncharacterized protein family UPF0023 66720 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 66733 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 66733 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 66734 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 66745 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 66747 F-actin capping protein alpha subunit 66767 Fructose-bisphosphate aldolase class-I 66786 7 transmembrane receptor (rhodopsin family) 66795 Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 66799 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 66805 Tetraspanin family 66808 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 66809 Intermediate filament protein 66810 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 66821 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 66824 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 66830 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 66844 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 66848 Alpha-L-fucosidase 66853 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 66854 B-box zinc finger 66854 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 66860 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 66861 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 66863 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 66863 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 66863 Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activate 66866 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid 66866 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph 66867 HMG (high mobility group) box 66871 C2 domain 66873 Fibronectin type III domain 66873 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 66877 HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins 66889 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 66890 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 66892 Eukaryotic initiation factor 4E 66901 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 66902 Phosphorylase family 2 66904 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 66905 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 66913 ER lumen protein retaining receptor 66914 VPS28 protein 66921 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou 66921 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about 66926 Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3 66938 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 66943 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 66945 Fumarate reductase/succinate dehydrogenase flavoprotein C-terminal domain. This family contains fumarate reductases, succinate dehydrogenases and L-aspartate oxidases 66945 pfam02910, succ_DH_flav_C, Fumarate reductase/succinate dehydrogenase flavoprotein C-terminal domain. This family contains fumarate reductases, succinate dehydrogenases and L-aspartate oxidases 66945 FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase 66957 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 66958 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 66959 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 66964 Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events 66968 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 66977 Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein h 66978 Protein of unknown function, DUF259 66978 Protein of unknown function, DUF259 66993 BAF60b domain of the SWIB complex 67000 Somatotropin hormone family 67016 PH domain. PH stands for pleckstrin homology 67016 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 67028 Eukaryotic protein of unknown function, DUF279 67038 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 67042 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 67043 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 67052 HEC/Ndc80p family. Members of this family are components of the mitotic spindle. It has been shown that Ndc80/HEC from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle 67053 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 67054 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 67054 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 67064 Eukaryotic protein of unknown function, DUF279 67068 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 67071 Protein kinase domain 67071 Protein kinase C terminal domain 67072 Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90 67073 Phosphatidylinositol 3- and 4-kinase 67075 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 67097 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 67106 Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues 67111 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 67115 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 67117 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 67123 UBA/TS-N domain 67131 Acyl CoA binding protein 67133 Zona pellucida-like domain 67136 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 67160 Elongation factor 1 gamma, conserved domain 67161 DNA gyrase/topoisomerase IV, subunit A 67161 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 67161 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 67166 ADP-ribosylation factor family 67168 7 transmembrane receptor (rhodopsin family) 67169 Death domain 67178 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 67181 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 67187 MYND finger 67199 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 67201 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 67204 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 67205 Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This 67224 V-type ATPase 116kDa subunit family 67231 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 67239 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 67252 CAP protein 67260 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 67281 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 67283 Mitochondrial carrier protein 67285 Cyclophilin type peptidyl-prolyl cis-trans isomerase 67286 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 67287 Integral membrane protein DUF110. This archaebacterial protein family has no known function. Some members of this family are annotated as FlaJ, however we can find no supporting evidence for this annotation 67287 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 67300 Coatomer WD associated domain 67300 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 67300 Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each leg 67302 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 67307 Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal 67310 Somatotropin hormone family 67332 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 67338 FYVE zinc finger 67344 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 67366 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 67369 Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally im 67370 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 67373 Trypsin 67373 Trypsin 67384 BAG domain. Domain present in Hsp70 regulators 67390 SpoU rRNA Methylase family. This family of proteins probably use S-AdoMet 67393 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 67398 Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of 67402 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 67412 Uncharacterized ACR, COG1579 67414 GTPase of unknown function 67420 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 67427 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 67429 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 67429 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 67441 Isochorismatase family. This family are hydrolase enzymes 67443 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 67444 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 67446 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 67452 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 67453 Mitochondrial carrier protein 67455 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 67459 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 67459 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 67464 GDA1/CD39 (nucleoside phosphatase) family 67466 Phosducin 67471 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 67484 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 67489 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 67492 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 67492 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets in 67498 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 67498 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 67498 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 67498 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 67500 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 67505 Somatotropin hormone family 67511 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 67512 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 67516 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 67525 Zinc finger, C3HC4 type (RING finger) 67525 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 67526 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 67527 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 67527 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 67527 Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus 67528 NUDIX domain 67528 NUDIX domain 67530 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly i 67533 Uncharacterized ACR, COG1579 67533 Intermediate filament protein 67533 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 67533 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 67533 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 67543 Poly-adenylate binding protein, unique domain 67547 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 67553 ubiE/COQ5 methyltransferase family 67553 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 67553 Glucose inhibited division protein. This is a family of bacterial Glucose inhibited division proteins these are probably involved in the regulation of cell devision 67553 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 67554 Mitochondrial carrier protein 67561 WD domain, G-beta repeat 67562 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 67563 pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain 67573 Lysyl oxidase 67574 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain conta 67582 Mitochondrial carrier protein 67586 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 67591 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 67597 Protein kinase C terminal domain 67603 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 67604 Protein of unknown function (DUF410). Family of conserved eukaryotic proteins with undetermined function 67607 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 67608 pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain 67630 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 67634 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 67666 Extracellular link domain 67666 Extracellular link domain 67666 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 67667 ubiE/COQ5 methyltransferase family 67671 Ribosomal L38e protein family 67674 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 67674 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 67675 CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide var 67681 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 67684 Protein of unknown function, DUF259 67689 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 67702 Zinc finger, C3HC4 type (RING finger) 67702 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 67703 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 67703 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 67712 Mitochondrial carrier protein 67712 Mitochondrial carrier protein 67713 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 67717 ab-hydrolase associated lipase region 67722 Actin 67728 Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s 67728 Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein sy 67728 Putative diphthamide synthesis protein.One member is a candidate tumour suppressor gene. DPH2 from yeast1, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein sy 67738 Cyclophilin type peptidyl-prolyl cis-trans isomerase 67760 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 67760 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 67768 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 67772 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 67784 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 67790 ADP-ribosylation factor family 67790 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 67795 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 67800 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 67803 LIM domain. This family represents two copies of the LIM structural domain 67804 PX domain. PX domains bind to phosphoinositides 67804 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 67815 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 67815 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 67821 Sodium / potassium ATPase beta chain 67830 Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex 67834 Isocitrate/isopropylmalate dehydrogenase 67841 Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 67841 Autophagocytosis associated protein, N-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 67843 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 67848 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 67848 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 67854 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 67854 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 67855 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 67860 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 67861 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 67863 Mitochondrial carrier protein 67878 Uncharacterised protein family (UPF0121). Uncharacterised integral membrane protein family 67880 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 67890 Uncharacterised protein family (UPF0185). This family contains a number of small uncharacterised proteins including BM-002 67891 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 67891 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 67892 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 67895 Inorganic pyrophosphatase 67897 mRNA capping enzyme, large subunit 67909 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 67931 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 67933 Fibronectin type III domain 67941 Ribosomal protein S27 67942 ATP synthase subunit C 67952 MAS20 protein import receptor 67958 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 67958 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 67963 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 67972 E1-E2 ATPase 67972 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 67972 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 67972 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 67973 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerev 67974 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 67980 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 67981 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 67988 Calsequestrin 67988 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 67991 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 68009 Defensin propeptide 68011 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 68028 Ribosomal L22e protein family 68039 Bombesin-like peptide 68047 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 68052 Ribosomal protein S15 68053 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 68054 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 68059 Endomembrane protein 70 68066 Mitochondrial carrier protein 68079 Programmed cell death protein 2, C-terminal domain 68080 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 68082 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 68087 Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction 68090 Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1 68097 Dynein light chain type 1 68107 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 68133 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl grou 68134 Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3 68137 ER lumen protein retaining receptor 68140 Tc5 transposase 68140 CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues, 68140 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 68146 ADP-ribosylation factor family 68153 TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription 68159 Syntaxin 68178 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 68192 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei 68193 Ribosomal protein L24e 68195 Ribonuclease T2 family 68212 Uncharacterized protein family UPF0005 68219 MutT-like domain 68239 Intermediate filament protein 68267 Mitochondrial carrier protein 68273 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o 68273 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o 68275 Replication factor-A protein 1, N-terminal domain 68278 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 68278 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 68279 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 68280 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 68291 Glucose inhibited division protein A 68292 Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase 68299 Vps53-like, N-terminal domain. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 68304 Filamin/ABP280 repeat 68311 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 68312 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 68312 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 68337 LIM domain. This family represents two copies of the LIM structural domain 68338 Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events 68346 Sir2 family 68346 NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide 68348 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 68349 Respiratory-chain NADH dehydrogenase, 30 Kd subunit 68352 Domain of unknown function DUF108. This family has no known function. It is found to compose the complete protein in archaebacteria and a single domain in a large C. elegans protein 68360 Electron transfer flavoprotein beta subunit. This protein is distantly related to and forms a heterodimer with pfam00766 68371 Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal 68393 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 68396 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 68401 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 68423 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68423 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68427 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 68436 Ribosomal protein L34e 68441 ADP-ribosylation factor family 68441 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 68444 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 68453 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 68460 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 68463 Ribosomal protein L14p/L23e 68465 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 68473 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 68479 Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs 68494 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68497 Filamin/ABP280 repeat 68497 Filamin/ABP280 repeat 68497 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 68498 Tetraspanin family 68507 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 68509 Pentaxin family. Pentaxins are also known as pentraxins 68511 Doublecortin 68526 Auxin Efflux Carrier 68526 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 68553 von Willebrand factor type A domain 68553 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 68553 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 68572 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly con 68581 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 68585 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 68588 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 68591 MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold 68591 MOSC domain. The MOSC (MOCO sulfurase C-terminal) domain is a superfamily of beta-strand-rich domains identified in the molybdenum cofactor sulfurase and several other proteins from both prokaryotes and eukaryotes. These MOSC domains contain an absolutely 68598 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 68603 Phosphomevalonate kinase. Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This family represents the animal ty 68604 WD domain, G-beta repeat 68606 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 68607 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 68616 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 68621 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 68621 Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 68621 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 68628 WD domain, G-beta repeat 68646 ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus 68653 Bacterial surface antigen. This entry includes the following surface antigens 68666 Sugar (and other) transporter 68667 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 68667 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 68668 Trypsin 68678 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 68701 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68701 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68703 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 68705 Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subun 68723 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 68724 ADP-ribosylation factor family 68729 B-box zinc finger 68729 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 68730 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou 68731 Ribosome-binding factor A 68735 Ribosomal protein S18 68750 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 68753 Fibronectin type III domain 68760 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 68764 Cadherin domain 68774 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 68775 V-ATPase subunit C 68792 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc 68794 Filamin/ABP280 repeat 68801 GNS1/SUR4 family 68802 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 68810 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 68815 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 68815 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 68828 Intermediate filament protein 68833 Phosducin 68833 Phosducin 68837 Fork head domain 68837 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 68839 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68839 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 68842 Tub family 68860 Cache domain 68865 Arv1-like family. Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis 68870 Shikimate kinase 68870 Adenylate kinase 68870 Papillomavirus helicase. This protein is a DNA helicase that is required for initiation of viral DNA replication. This protein forms a complex with the E2 protein pfam00508 68895 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 68897 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 68897 7 transmembrane receptor (metabotropic glutamate family) 68897 AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance 68897 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 68897 MMPL family. Members of this family are putative integral membrane proteins from bacteria. Several of the members are mycobacterial proteins. Many of the proteins contain two copies of this aligned region. The function of these proteins is not known, alth 68904 Domain of unknown function (DUF341) 68904 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 68904 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 68911 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 68915 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 68917 HIT family 68922 WD domain, G-beta repeat 68926 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 68929 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 68938 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 68938 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 68939 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 68942 Eukaryotic protein of unknown function, DUF279 68942 Eukaryotic protein of unknown function, DUF279 68943 Protein kinase domain 68943 Protein kinase domain 68947 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 68953 Eukaryotic protein of unknown function, DUF279 68961 Protein kinase domain 68966 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 68969 Translation initiation factor SUI1 68972 Metalloenzyme of unknown function DUF136. This family of archaeal proteins has no known function. However they appear to be related to a large superfamily of metalloenzymes 68972 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 68976 Alpha-L-fucosidase 68977 Metallo-beta-lactamase superfamily 68980 WD domain, G-beta repeat 68992 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 68999 Anaphase-promoting complex, subunit 10 (APC10) 69008 Mo25 protein family 69014 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 69020 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 69024 MIT domain 69024 PX domain. PX domains bind to phosphoinositides 69024 MIT domain 69024 PX domain. PX domains bind to phosphoinositides 69028 MIT domain 69032 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme 69036 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 69047 E1-E2 ATPase 69047 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 69047 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 69049 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 69051 Delta 1-pyrroline-5-carboxylate reductase 69052 SH2 domain 69060 Lipase 69070 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 69077 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 69083 Sulfotransferase protein 69089 60Kd inner membrane protein 69091 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 69097 B-box zinc finger 69097 B-box zinc finger 69106 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 69106 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 69109 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 69113 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 69117 Zinc-binding dehydrogenase 69117 Prephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis 69121 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 69123 Acyl CoA binding protein 69144 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 69149 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 69149 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 69149 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 69149 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 69156 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 69156 ubiE/COQ5 methyltransferase family 69156 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 69168 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 69171 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 69178 PX domain. PX domains bind to phosphoinositides 69183 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 69185 DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after 69186 Protein of unknown function (DUF423). Potential integral membrane protein 69188 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 69188 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 69191 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 69207 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 69215 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 69219 Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzymes 69226 PX domain. PX domains bind to phosphoinositides 69228 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 69228 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 69228 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 69228 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 69239 PH domain. PH stands for pleckstrin homology 69239 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 69241 RNA polymerase Rpb4 69241 RNA polymerase Rpb4 69253 Hsp20/alpha crystallin family 69256 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 69256 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 69259 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 69259 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 69259 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 69270 Uncharacterised protein family (UPF0080) 69274 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 69276 Translocation protein Sec62 69277 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 69286 SCP-like extracellular protein 69286 SCP-like extracellular protein. This domain is also found in prokaryotes 69288 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 69288 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 69294 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 69294 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 69317 HMG (high mobility group) box 69335 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription 69352 Uncharacterized ACR, YneC family COG1359 69354 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 69362 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 69362 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 69372 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 69379 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 69382 Core histone H2A/H2B/H3/H4 69382 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 69389 Core histone H2A/H2B/H3/H4 69416 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 69416 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 69435 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist 69444 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 69459 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 69462 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 69511 Trypsin 69538 von Willebrand factor type A domain 69562 Protein kinase domain 69574 Dienelactone hydrolase family 69574 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 69583 TNF(Tumor Necrosis Factor) family 69597 Peptidase family M41 69597 Sigma-54 interaction domain 69597 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 69601 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 69602 Protein of unknown function, DUF270 69608 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 69632 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 69632 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 69635 Death domain 69656 Uncharacterized BCR, YhhW family COG1741 69660 Uncharacterized protein family UPF0005 69672 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 69674 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 69675 Animal haem peroxidase 69675 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 69675 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 69677 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 69687 Yippee putative zinc-binding protein 69693 Intermediate filament protein 69693 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 69713 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 69716 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 69716 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 69718 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 69719 Aspartate/ornithine carbamoyltransferase, Asp/Orn binding domain 69719 Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding domain 69719 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 69719 D-ala D-ala ligase. This family contains D-alanine--D-alanine ligase enzymes EC:6.3.2.4 69719 Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn 69719 Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy 69719 Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in prokar 69719 MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also i 69721 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 69726 MYND finger 69727 Ubiquitin carboxyl-terminal hydrolase family 2 69727 Ubiquitin carboxyl-terminal hydrolases family 2 69737 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 69745 DNA polymerase delta, subunit 4 69761 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 69772 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 69774 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 69780 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 69787 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 69814 Trypsin 69826 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 69833 RNA polymerase Rpb6. Rpb6 is an essential subunit in the eukaryotic polymerases Pol I, II and III. The bacterial equivalent to Rpb6 is the omega subunit. Rpb6 and omega are structurally conserved and both function in polymerase assembly 69837 Zinc finger, C3HC4 type (RING finger) 69847 Protein kinase domain 69860 Eukaryotic initiation factor 1A 69864 Phospholipase A2 inhibitor 69865 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 69888 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 69902 Ribosomal protein L10 69903 DIL domain. The DIL domain has no known function 69906 Mitochondrial carrier protein 69908 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 69920 RNA polymerases M/15 Kd subunit 69923 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 69934 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 69955 tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain 69965 'Cold-shock' DNA-binding domain 69976 GHMP kinases putative ATP-binding protein 69981 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in resp 69993 SH2 domain 69993 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 69993 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 70008 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 70009 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 70019 ENV polyprotein (coat polyprotein) 70025 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C 70026 TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia 70028 Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in 70044 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 70047 Poly A polymerase family. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of 70052 WD domain, G-beta repeat 70059 Fatty acid desaturase 70061 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 70061 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 70065 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 70065 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 70065 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 70073 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 70086 7 transmembrane receptor (rhodopsin family) 70093 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 70097 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 70099 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 70099 RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated 70099 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 70101 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 70114 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 70120 tRNA synthetases class I (W and Y) 70122 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 70152 ubiE/COQ5 methyltransferase family 70160 Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking 70160 Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking 70163 Phospholipase A2 inhibitor 70166 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 70192 Lectin C-type domain. This family includes both long and short form C-type 70202 Papain family cysteine protease 70207 Domain of unknown function DUF28. This domain is found in bacterial and yeast proteins it compromises the entire length or central region of most of the proteins in the family, all of which are hypothetical with no known function. The average length of th 70218 Kinesin motor domain 70223 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 70223 OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata 70233 GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein 70235 WD domain, G-beta repeat 70310 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 70314 Rabaptin 70315 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 70335 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 70349 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 70355 7 transmembrane receptor (metabotropic glutamate family) 70361 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 70363 Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol 70364 Fibronectin type III domain 70369 BAG domain. Domain present in Hsp70 regulators 70370 Sushi domain (SCR repeat) 70370 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 70370 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 70380 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 70381 PH domain. PH stands for pleckstrin homology 70383 UbiA prenyltransferase family 70406 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 70425 Protein kinase domain 70428 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase comp 70430 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 70432 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 70432 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 70435 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 70439 Zn-finger in Ran binding protein and others 70439 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 70450 C2 domain 70451 NADP oxidoreductase coenzyme F420-dependent 70451 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 70456 Uncharacterised protein family (UPF0041) 70472 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 70472 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 70478 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but func 70479 PX domain. PX domains bind to phosphoinositides 70503 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 70508 HMG (high mobility group) box 70520 SCP-like extracellular protein 70520 SCP-like extracellular protein. This domain is also found in prokaryotes 70527 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 70530 Fibronectin type III domain 70533 NAC domain 70536 Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally imp 70546 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 70549 I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin 70550 Gelsolin repeat 70550 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 70551 TPR Domain 70556 Mitochondrial carrier protein 70559 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 70561 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 70571 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 70571 FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions 70574 Zinc carboxypeptidase 70584 Protein kinase domain 70584 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 70604 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 70611 F-box domain 70616 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 70620 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 70620 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 70620 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 70640 NUDIX domain 70646 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 70650 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol 70673 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 70673 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 70673 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 70676 Phosphotyrosine interaction domain (PTB/PID) 70676 Phosphotyrosine interaction domain (PTB/PID) 70686 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 70693 7 transmembrane receptor (Secretin family) 70693 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 70693 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 70696 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 70719 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 70719 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 70727 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 70729 Phosphotyrosine interaction domain (PTB/PID) 70737 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 70747 Tetraspanin family 70750 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 70767 PWI domain 70772 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 70772 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 70784 ADP-ribosylation factor family 70784 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 70785 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 70785 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 70785 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 70788 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 70788 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 70796 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 70796 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 70797 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 70797 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 70802 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 70804 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 70810 Intermediate filament protein 70823 HMG (high mobility group) box 70834 Intermediate filament protein 70839 7 transmembrane receptor (rhodopsin family) 70840 Sugar (and other) transporter 70843 Intermediate filament protein 70861 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 70866 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 70866 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 70873 Cyclic nucleotide-binding domain 70882 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 70892 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 70893 Glycosyl hydrolases family 35 70902 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 70902 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 70911 Fibronectin type III domain 70911 Fibronectin type III domain 70927 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 70928 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 70930 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 70945 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 70945 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 70965 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 70974 Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 70974 Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 70974 pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 70974 pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 70980 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 70980 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 70981 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 70981 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 70990 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 70999 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 71003 Trypsin 71027 Eukaryotic protein of unknown function, DUF284. Members of this family have no known function. They have been predicted to contain transmembrane helices 71027 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 71037 Trypsin 71041 Zinc finger, C3HC4 type (RING finger) 71062 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 71063 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 71066 HSF-type DNA-binding 71066 HSF-type DNA-binding domain 71078 Disintegrin 71078 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 71078 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 71085 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71089 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 71091 Protein kinase domain 71093 Helix-loop-helix DNA-binding domain 71103 Glycosyltransferase family 6 71131 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 71137 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 71147 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 71147 Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains 71147 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 71164 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 71176 F-box domain 71183 Lectin C-type domain. This family includes both long and short form C-type 71207 NUDIX domain 71228 Guanylate kinase 71268 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 71275 Delta 1-pyrroline-5-carboxylate reductase 71279 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 71287 Serine carboxypeptidase 71302 PH domain. PH stands for pleckstrin homology 71302 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71302 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 71310 GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 71310 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 71310 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 71330 Regulator of chromosome condensation (RCC1) 71330 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 71351 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 71355 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 71365 Polyprenyl synthetase 71367 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 71375 Fork head domain 71382 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 71389 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 71393 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 71393 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 71395 CUB domain 71395 CUB domain 71395 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 71412 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 71435 PH domain. PH stands for pleckstrin homology 71435 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71436 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 71436 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 71436 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 71452 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 71468 Homeobox domain 71472 MYND finger 71472 Ubiquitin carboxyl-terminal hydrolase family 2 71472 Ubiquitin carboxyl-terminal hydrolases family 2 71519 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 71520 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 71522 Gamma-glutamyltranspeptidase 71538 F-box domain 71540 PX domain. PX domains bind to phosphoinositides 71544 PH domain. PH stands for pleckstrin homology 71544 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71567 MCM2/3/5 family 71584 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 71591 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 71592 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 71592 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 71597 Homeobox domain 71599 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 71599 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 71602 Myosin head (motor domain) 71602 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 71609 Death domain 71619 ADP-ribosylation factor family 71619 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 71619 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 71648 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 71665 Alpha-L-fucosidase 71678 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 71682 WD domain, G-beta repeat 71685 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 71693 Lectin C-type domain. This family includes both long and short form C-type 71699 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples hav 71709 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71709 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71710 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 71710 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 71720 Oxysterol-binding protein 71726 Uracil DNA glycosylase superfamily 71729 Raf-like Ras-binding domain 71729 Phosphotyrosine interaction domain (PTB/PID) 71729 Raf-like Ras-binding domain 71729 Phosphotyrosine interaction domain (PTB/PID) 71729 LGN motif, putative GEF specific for G-alpha GTPase 71729 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 71729 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 71732 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 71733 Sushi domain (SCR repeat) 71738 MAM domain. An extracellular domain found in many receptors 71738 MAM domain. An extracellular domain found in many receptors 71742 MIT domain 71742 Protein kinase domain 71743 Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction 71745 Cullin family 71746 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 71749 Zinc-binding dehydrogenase 71754 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 71756 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 71761 Chlorohydrolase. This family consist of chlorohydrolase from the ATZ/TRZ family 71761 Adenine deaminase. Adenine deaminase EC:3.5.4.2 hydrolyses adenine to form hypoxanthine and ammonia. The enzyme is part of a large metal dependent hydrolase superfamily. Adenine deaminases reaction is important for adenine utilization as a purine and also 71766 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 71766 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 71768 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 71770 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 71770 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 71773 UDP-glucoronosyl and UDP-glucosyl transferase 71775 Transferrin 71778 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 71778 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 71778 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 71780 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play 71787 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 71790 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 71791 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 71791 Zinc carboxypeptidase 71791 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 71793 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 71795 Phosphatidylinositol transfer protein 71801 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 71801 PH domain. PH stands for pleckstrin homology 71801 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 71803 Mitochondrial carrier protein 71803 Mitochondrial carrier protein 71805 Nucleoporin interacting componentent 71810 RanBP1 domain 71816 Zinc finger, C3HC4 type (RING finger) 71819 Kinesin motor domain 71820 WD domain, G-beta repeat 71820 WD domain, G-beta repeat 71822 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 71822 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 71822 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 71824 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 71827 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 71828 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 71830 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 71832 Citrate synthase 71833 WD domain, G-beta repeat 71834 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 71840 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 71847 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 71853 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 71854 Renal dipeptidase 71865 F-box domain 71869 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 71879 AMP-binding enzyme 71881 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 71883 UbiA prenyltransferase family 71884 Glycosyl hydrolases family 18 71884 Oxygen-independent Coproporphyrinogen III oxidase. This family of bacterial enzymes catalyses the anaerobic transformation of coproporphyrinogen III to protoporphyrinogen IX required for porphyrin biosynthesis 71887 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 71888 Intermediate filament protein 71889 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 71890 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 71891 Cytidine and deoxycytidylate deaminase zinc-binding region 71893 PX domain. PX domains bind to phosphoinositides 71902 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 71903 Carboxylesterase 71904 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 71907 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 71908 PMP-22/EMP/MP20/Claudin family 71910 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 71911 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 71916 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou 71922 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 71924 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 71924 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 71932 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 71934 Eukaryotic-type carbonic anhydrase 71941 tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes 71941 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 71941 tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases 71943 VHS domain. Domain present in VPS-27, Hrs and STAM 71943 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 71949 Homeobox domain 71949 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 71950 Homeobox domain 71951 Glypican 71955 Eukaryotic protein of unknown function, DUF292 71960 Myosin head (motor domain) 71960 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 71966 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 71967 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 71972 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 71972 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 71972 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 71981 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 71982 PX domain. PX domains bind to phosphoinositides 71985 Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold 71985 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 71985 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 71985 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 71989 RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to 71990 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 71994 Calponin family repeat 71996 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 71998 Mitochondrial carrier protein 72000 Intermediate filament tail domain 72002 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 72002 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 72003 Synaptophysin / synaptoporin 72007 Fibronectin type III domain 72008 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 72014 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 72017 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 72023 Cytochrome b561 72026 tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs 72027 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 72033 TSC-22/dip/bun family 72040 Cadherin domain 72053 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 72054 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 72055 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 72057 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 72061 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 72068 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 72076 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 72077 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 72082 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 72090 GDA1/CD39 (nucleoside phosphatase) family 72094 UDP-glucoronosyl and UDP-glucosyl transferase 72096 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 72096 Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity 72106 jmjC domain 72121 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 72121 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 72121 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 72129 Peroxin 13, N-terminal domain. Both termini of the Peroxin-13 are oriented to the cytosol. Peroxin-13 is required for peroxisomal association of peroxin-14 72133 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that 72135 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 72136 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 72137 WD domain, G-beta repeat 72137 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 72147 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 72151 Rad17 cell cycle checkpoint protein 72151 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 72152 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 72152 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 72154 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 72160 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 72162 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 72162 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 72167 ubiE/COQ5 methyltransferase family 72167 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 72167 Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity 72168 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 72169 B-box zinc finger 72175 Sugar (and other) transporter 72179 F-box domain 72183 PX domain. PX domains bind to phosphoinositides 72184 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 72184 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 72194 F-box domain 72205 HELP domain. The HELP (Hydrophobic ELP) domain is found in EMAP and EMAP-like proteins (ELPs). Although called a domain it contains a predicted transmembrane helix and may not form a globular domain. It is also not clear if these proteins localize to memb 72205 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 72238 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 72252 NAD-dependent glycerol-3-phosphate dehydrogenase 72258 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 72265 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 72269 Cytidine and deoxycytidylate deaminase zinc-binding region 72278 Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown 72281 SH2 domain 72287 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 72290 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 72294 7 transmembrane receptor (rhodopsin family) 72296 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 72296 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 72297 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 72303 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 72303 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 72305 Tetraacyldisaccharide-1-P 4'-kinase. This family consists of tetraacyldisaccharide-1-P 4'-kinase also known as Lipid-A 4'-kinase or Lipid A biosynthesis protein LpxK, EC:2.7.1.130. This enzyme catalyses the reaction: ATP + 2,3-bis(3-hydroxytetradecanoyl)- 72310 PMP-22/EMP/MP20/Claudin family 72318 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 72318 PH domain. PH stands for pleckstrin homology 72318 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 72325 Vacuolar sorting protein 9 (VPS9) domain 72330 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 72333 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 72344 Ubiquitin carboxyl-terminal hydrolase family 2 72344 Ubiquitin carboxyl-terminal hydrolases family 2 72349 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 72351 Protein prenyltransferase alpha subunit repeat 72361 Carboxylesterase 72373 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 72378 PH domain. PH stands for pleckstrin homology 72378 PH domain. PH stands for pleckstrin homology 72378 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 72388 Protein kinase domain 72393 Uncharacterized protein family UPF0005 72399 Zn-finger in ubiquitin-hydrolases and other protein 72413 Calcium-activated potassium channel, beta subunit 72432 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 72459 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 72461 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 72461 Aminopeptidase I zinc metalloprotease (M18) 72465 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 72469 PH domain. PH stands for pleckstrin homology 72479 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 72479 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 72480 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 72480 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 72482 Acyl CoA binding protein 72500 7 transmembrane receptor (Secretin family) 72508 Protein kinase domain 72508 Protein kinase C terminal domain 72515 WD domain, G-beta repeat 72535 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 72542 Phosphoglycerate mutase family 72542 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 72544 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 72549 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 72549 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 72552 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 72556 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 72560 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 72562 Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha) 72573 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 72584 Cullin family 72590 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 72599 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 72605 Eukaryotic-type carbonic anhydrase 72607 Ubiquitin carboxyl-terminal hydrolase family 2 72607 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 72634 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 72634 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 72650 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 72654 Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA r 72654 Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re 72662 RNB-like protein. The function of this region of similarity is uncertain 72667 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 72667 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 72674 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 72674 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 72685 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 72686 Ubiquitin carboxyl-terminal hydrolase family 2 72727 Glycosyltransferase family 43 72736 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 72739 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 72739 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 72739 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 72749 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 72776 Uncharacterized ACR, COG1579 72776 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 72776 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 72778 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 72790 Single-strand binding protein family 72823 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 72825 SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs 72828 Phosphoglycerate mutase family 72828 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 72828 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 72828 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 72843 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 72844 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 72852 Metallo-beta-lactamase superfamily 72895 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 72900 Respiratory-chain NADH dehydrogenase 24 Kd subunit 72925 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 72927 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 72938 Anaphase-promoting complex, subunit 10 (APC10) 72949 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 72958 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 72960 Eukaryotic DNA topoisomerase I, DNA binding fragment. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and rec 72960 Eukaryotic DNA topoisomerase I, catalytic core. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombina 72960 pfam02919, Topoisomer_I_N, Eukaryotic DNA topoisomerase I, DNA binding fragment. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replicat 72961 Sugar (and other) transporter 72978 Cornichon protein 72993 PH domain. PH stands for pleckstrin homology 72993 Phosphotyrosine interaction domain (PTB/PID) 73010 7 transmembrane receptor (rhodopsin family) 73062 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 73062 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 73068 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 73075 Cyclophilin type peptidyl-prolyl cis-trans isomerase 73078 Insulinase (Peptidase family M16) 73086 Protein kinase domain 73086 Protein kinase C terminal domain 73094 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 73095 Mitochondrial carrier protein 73102 Sugar (and other) transporter 73106 Trypsin 73130 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 73132 Mitochondrial carrier protein 73132 Mitochondrial carrier protein 73159 Cadherin domain 73162 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 73166 Ergosterol biosynthesis ERG4/ERG24 family 73173 Cadherin domain 73178 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 73178 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 73178 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 73181 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 73192 Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl 73218 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 73218 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 73218 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 73230 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 73244 Somatotropin hormone family 73246 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 73250 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 73251 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 73254 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 73254 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 73254 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 73296 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 73296 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 73316 Calreticulin family 73316 Calreticulin family 73327 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 73332 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 73333 Mitochondrial carrier protein 73333 Mitochondrial carrier protein 73338 Mab-21 protein 73341 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 73341 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 73353 Actin 73360 Actin 73368 Fibronectin type III domain 73368 Thrombospondin N-terminal -like domain 73368 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 73379 LCCL domain 73379 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 73382 Trypsin 73412 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 73412 Nucleoside diphosphate kinase 73430 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 73430 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 73442 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 73447 WD domain, G-beta repeat 73458 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 73547 Protein-tyrosine phosphatase 73547 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 73626 Trypsin 73626 Trypsin 73642 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 73642 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 73647 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 73649 Cytochrome b561 73656 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 73658 Sugar (and other) transporter 73674 WD domain, G-beta repeat 73680 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 73682 L1 transposable element 73682 L1 transposable element 73683 WD domain, G-beta repeat 73699 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 73699 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 73707 Protein kinase domain 73707 Adenylate and Guanylate cyclase catalytic domain 73713 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 73724 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 73732 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 73744 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 73748 Pyridoxal-dependent decarboxylase conserved domain 73804 Kinesin motor domain 73809 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 73825 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 73826 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 73834 ATP synthase subunit D. This is a family of subunit D form various ATP synthases including V-type H+ transporting and Na+ dependent. Subunit D is suggested to be an integral part of the catalytic sector of the V-ATPase 73859 Inward rectifier potassium channel 73902 Proteasome A-type and B-type 73910 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 73914 Death domain 73914 Protein kinase domain 73940 Extracellular link domain 73983 7 transmembrane receptor (metabotropic glutamate family) 73988 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 73991 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 73991 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 74002 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 74006 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 74007 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 74007 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 74008 Sulfatase 74011 Mitochondrial carrier protein 74018 Vacuolar sorting protein 9 (VPS9) domain 74020 C2 domain 74022 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 74030 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 74032 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 74038 ENV polyprotein (coat polyprotein) 74038 ENV polyprotein (coat polyprotein) 74042 jmjC domain 74042 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 74044 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 74044 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 74044 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 74051 NADP oxidoreductase coenzyme F420-dependent 74055 C2 domain 74071 Intermediate filament tail domain 74090 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 74102 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 74105 VHS domain. Domain present in VPS-27, Hrs and STAM 74105 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 74105 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 74105 Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found 74108 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 74111 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 74114 Choline/Carnitine o-acyltransferase 74121 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 74127 Intermediate filament protein 74127 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 74129 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 74134 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 74136 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 74136 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 74140 Endomembrane protein 70 74142 NB-ARC domain 74142 ATP-dependent protease La (LON) domain 74142 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 74142 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 74143 Dynamin family 74145 Transglutaminase family 74145 Transglutaminase family, C-terminal ig like domain 74145 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 74149 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 74153 Repeat in ubiquitin-activating (UBA) protein 74153 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 74153 Repeat in ubiquitin-activating (UBA) protein 74153 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 74157 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 74157 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 74158 Josephin 74159 Acyl CoA binding protein 74176 Transglutaminase family 74176 Transglutaminase family, C-terminal ig like domain 74176 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 74180 von Willebrand factor type D domain 74180 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 74180 von Willebrand factor type D domain 74180 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 74180 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 74180 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 74182 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 74182 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 74186 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 74187 WD domain, G-beta repeat 74187 Cytochrome D1 heme domain. Cytochrome cd1 (nitrite reductase) catalyses the conversion of nitrite to nitric oxide in the nitrogen cycle. This family represents the d1 heme binding domain of cytochrome cd1, in which His/Tyr side chains ligate the d1 heme i 74188 Somatotropin hormone family 74191 7 transmembrane receptor (rhodopsin family) 74195 Radical SAM superfamily 74197 TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins 74199 LCCL domain 74199 von Willebrand factor type A domain 74202 LIM domain. This family represents two copies of the LIM structural domain 74205 AMP-binding enzyme 74213 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 74213 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 74213 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 74215 Trypsin 74222 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 74229 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 74237 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 74238 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 74244 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 74245 Glycosyl hydrolases family 18 74246 Glutamate/Leucine/Phenylalanine/Valine dehydrogenase 74246 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 74246 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 74246 Isoflavone reductase. This is a family of isoflavone reductases from plants. Isoflavone reductase enzymes EC:1.3.1.45 catalyse the penultimate step in the synthesis of the phytoalexin medicarpin 74246 Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI) 74246 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 74251 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 74251 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 74253 Lectin C-type domain. This family includes both long and short form C-type 74254 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 74256 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 74257 Tetraspanin family 74270 Ubiquitin carboxyl-terminal hydrolase family 2 74270 Ubiquitin carboxyl-terminal hydrolases family 2 74270 Zn-finger in ubiquitin-hydrolases and other protein 74271 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 74276 PMP-22/EMP/MP20/Claudin family 74279 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 74286 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 74306 Trypsin 74309 Oxysterol-binding protein 74315 Zinc finger, C3HC4 type (RING finger) 74316 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 74322 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 74325 Clathrin light chain 74326 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 74330 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 74338 Sodium:neurotransmitter symporter family 74352 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 74354 Guanylate kinase 74365 Zinc finger, C3HC4 type (RING finger) 74365 ATP-dependent protease La (LON) domain 74375 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 74377 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 74385 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 74409 Hyaluronidase 74410 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 74411 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 74419 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 74419 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 74419 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 74419 D-xylulose 5-phosphate/D-fructose 6-phosphate phosphoketolase. Bacterial enzyme splits fructose-6-P and/or xylulose-5-P with the aid of inorganic phosphate into either acetyl-P and erythrose-4-P and/or acetyl-P and glyeraldehyde-3-P EC:4.1.2.9, EC:4.1.2.2 74419 Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto 74434 Helix-loop-helix DNA-binding domain 74437 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 74438 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 74438 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 74438 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 74441 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 74441 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 74448 ADP-ribosylation factor family 74448 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 74450 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 74478 PX domain. PX domains bind to phosphoinositides 74479 PX domain. PX domains bind to phosphoinositides 74480 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 74483 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 74483 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 74486 Oxysterol-binding protein 74488 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 74490 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 74493 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 74493 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 74493 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 74499 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 74513 CUB domain 74519 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 74533 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 74551 Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate 74558 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 74558 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 74559 GNS1/SUR4 family 74561 Homeobox domain 74562 C2 domain 74563 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 74569 TPR Domain 74570 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 74571 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 74574 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 74577 Glycosyl hydrolases family 35 74581 PH domain. PH stands for pleckstrin homology 74585 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 74585 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 74589 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 74596 Phosphatidate cytidylyltransferase 74616 Peptidase family U34 74617 Serine carboxypeptidase 74649 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 74653 Protein kinase domain 74670 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 74672 Carboxylesterase 74675 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 74685 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 74686 Glycosyl hydrolases family 18 74686 Mitochondrial carrier protein 74694 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 74711 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 74718 PX domain. PX domains bind to phosphoinositides 74720 PMP-22/EMP/MP20/Claudin family 74729 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 74729 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 74732 Syntaxin 74734 ADP-ribosylation factor family 74734 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 74734 ADP-ribosylation factor family 74734 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 74735 B-box zinc finger 74753 Uncharacterised protein family UPF0066 74754 FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi 74762 MAM domain. An extracellular domain found in many receptors 74763 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 74769 PI3-kinase family, p85-binding domain 74769 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 74769 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 74769 Phosphatidylinositol 3- and 4-kinase 74769 PI3-kinase family, p85-binding domain 74769 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 74769 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 74769 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 74770 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 74772 E1-E2 ATPase 74772 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 74776 Inorganic pyrophosphatase 74782 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 74841 Ubiquitin carboxyl-terminal hydrolase family 2 74843 MYND finger 74851 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 74854 Glycosyltransferase family 6 74901 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 74901 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 74915 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 74919 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 74919 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 74943 CUB domain 74943 Sushi domain (SCR repeat) 74946 Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes 75002 Trypsin 75010 Uncharacterized protein family UPF0005 75019 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 75030 WD domain, G-beta repeat 75079 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 75079 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 75097 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 75141 ADP-ribosylation factor family 75141 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 75142 BED zinc finger 75156 Lipase/Acylhydrolase with GDSL-like motif 75196 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 75196 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 75199 Homeobox domain 75209 Sugar (and other) transporter 75210 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 75219 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 75221 Peptidase family M49 75266 MAS20 protein import receptor 75267 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 75291 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 75291 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 75292 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 75292 Protein kinase domain 75292 PH domain. PH stands for pleckstrin homology 75292 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 75305 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 75305 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 75307 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 75320 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 75320 Domain of unknown function (DUF227). This family includes a large number of drosophila proteins of unknown function. The family also includes several C. elegans proteins. The alignment contains many histidines and aspartates that are conserved, suggesting 75320 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 75352 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 75387 Sir2 family 75388 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 75396 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 75404 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 75410 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 75416 Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production 75420 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 75422 Conserved hypothetical protein 95 75422 C-5 cytosine-specific DNA methylase 75422 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 75424 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 75452 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 75456 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 75465 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 75471 ADP-ribosylation factor family 75471 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 75477 Profilin 75482 Hsp20/alpha crystallin family 75497 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 75512 Glutathione peroxidase 75516 TPR Domain 75526 WAP-type (Whey Acidic Protein) 'four-disulfide core' 75552 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 75555 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 75560 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 75572 Acylphosphatase 75578 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 75578 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 75580 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 75590 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 75590 Protein-tyrosine phosphatase 75590 Putative tyrosine phosphatase family. This family consists of putative tyrosine phosphatase proteins, this function is inferred from several sequences at the top of the noise, EC:3.1.3.48 75590 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 75593 Domain of unknown function DUF143. This domain has no known function nor do any of the proteins that possess it. The aligned region is approximately 100 amino acids long 75596 Somatotropin hormone family 75605 jmjC domain 75605 jmjN domain 75605 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 75605 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 75605 jmjN domain 75605 jmjC domain 75605 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 75605 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 75605 C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 75605 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 75612 Sulfatase 75612 Sulfatase 75617 S25 ribosomal protein 75617 S25 ribosomal protein 75625 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 75668 ADP-ribosylation factor family 75668 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 75668 ADP-ribosylation factor family 75668 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 75669 Protein kinase domain 75669 WD domain, G-beta repeat 75704 Carboxylesterase 75706 Intermediate filament protein 75712 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 75717 Cullin family 75718 von Willebrand factor type A domain 75725 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 75731 Shikimate kinase 75735 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 75739 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 75739 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 75739 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 75746 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 75746 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 75750 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f 75764 Endo/excinuclease amino terminal domain. This domain has no known function it is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE 75767 C2 domain 75769 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 75772 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 75778 Uncharacterized protein PaaI, COG2050 75778 Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins 75785 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 75785 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 75792 WD domain, G-beta repeat 75812 Asparaginase 75828 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 75835 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 75841 Zinc finger, C3HC4 type (RING finger) 75847 Uncharacterized protein family UPF0007 75859 Glycosyltransferase family 6 75863 Lectin C-type domain. This family includes both long and short form C-type 75869 ADP-ribosylation factor family 75870 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 75894 Adenosine/AMP deaminase 75911 7 transmembrane receptor (rhodopsin family) 75986 Arginase family 76074 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 76080 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 76080 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 76080 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 76080 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 76089 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 76089 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 76089 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 76113 Animal haem peroxidase 76130 Las1-like. Las1 is an essential nuclear protein involved in cell morphogenesis and cell surface growth 76135 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 76183 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 76187 Iron-containing alcohol dehydrogenase 76206 7 transmembrane receptor (rhodopsin family) 76222 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 76242 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 76251 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 76251 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 76251 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 76251 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 76251 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 76257 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 76263 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HC 76266 Occludin/ELL family 76266 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 76267 Fatty acid desaturase 76267 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 76279 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 76282 Aminotransferase class I and II 76299 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 76303 Oxysterol-binding protein 76303 PH domain. PH stands for pleckstrin homology 76306 LacY proton/sugar symporter. This family is closely related to the sugar transporter family 76308 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 76338 ADP-ribosylation factor family 76338 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 76366 Translation initiation factor IF-3 76373 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 76376 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 76400 Phosphatidylethanolamine-binding protein 76429 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 76441 Formin Homology 2 Domain 76459 Eukaryotic-type carbonic anhydrase 76459 Eukaryotic-type carbonic anhydrase 76469 Fibronectin type III domain 76469 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 76485 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 76487 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 76498 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 76499 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 76507 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 76507 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 76507 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 76522 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 76526 Glu-tRNAGln amidotransferase C subunit. This is a family of Glu-tRNAGln amidotransferase C subunits. The Glu-tRNA Gln amidotransferase enzyme itself is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correc 76561 PX domain. PX domains bind to phosphoinositides 76563 Amidase 76571 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 76572 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 76577 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 76577 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 76608 Anaphase-promoting complex, subunit 10 (APC10) 76608 Anaphase-promoting complex, subunit 10 (APC10) 76608 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 76615 Aminotransferase class I and II 76630 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 76645 Galactose binding lectin domain 76645 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 76645 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 76645 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 76646 WD domain, G-beta repeat 76652 Actin 76668 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also membe 76686 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 76688 ADP-ribosylation factor family 76703 Zinc carboxypeptidase 76703 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 76709 Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4 76722 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 76722 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 76737 Giardia variant-specific surface protein 76742 PX domain. PX domains bind to phosphoinositides 76742 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 76742 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 76748 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 76763 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 76763 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 76763 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 76768 Alkaline phosphatase 76770 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 76788 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 76793 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 76793 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 76795 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 76795 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 76804 jmjC domain 76804 jmjN domain 76808 Ribosomal L18ae protein family 76813 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 76834 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 76843 WD domain, G-beta repeat 76846 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 76846 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 76850 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 76850 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 76856 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 76856 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 76857 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 76863 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 76866 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 76867 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 76877 ADP-ribosylation factor family 76877 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 76890 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 76892 Branched-chain amino acid transport system / permease component. This is a large family mainly comprising high-affinity branched-chain amino acid transporter proteins such as E. coli LivH and LivM both of which are form the LIV-I transport system. Also f 76893 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 76898 Glycosyltransferase family 43 76898 Glycosyltransferase family 43 76899 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 76899 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 76901 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 76905 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 76915 Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair 76916 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 76943 Saposin A-type domain 76947 Squalene/phytoene synthase 76948 Zinc finger, C3HC4 type (RING finger) 76954 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 76954 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 76954 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 76954 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 76954 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 76954 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 76954 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 76954 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 76954 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 76965 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 76969 Sulfotransferase protein 76971 Sulfotransferase protein 76974 Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders 76980 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 76983 Sec1 family 77015 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 77018 Collagen triple helix repeat (20 copies) 77031 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 77031 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 77032 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 77038 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 77042 Hyaluronidase 77047 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 77055 Intermediate filament protein 77055 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 77057 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 77087 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 77087 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 77090 Occludin/ELL family 77113 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 77113 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 77116 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 77254 Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1 77286 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 77286 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 77305 WD domain, G-beta repeat 77318 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 77318 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 77337 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 77371 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 77407 ADP-ribosylation factor family 77407 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 77407 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 77407 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 77411 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 77432 Palmitoyl protein thioesterase 77480 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 77485 Protein kinase domain 77485 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 77485 Protein kinase domain 77485 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 77527 Cyclic nucleotide-binding domain 77531 Phosphotyrosine interaction domain (PTB/PID) 77532 CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues, 77532 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 77559 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 77569 LIM domain. This family represents two copies of the LIM structural domain 77573 Sec1 family 77577 Sugar (and other) transporter 77577 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 77579 Myosin head (motor domain) 77579 Myosin head (motor domain) 77579 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 77579 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 77579 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 77579 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 77579 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 77583 Pectinacetylesterase 77590 Sulfotransferase protein 77595 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 77595 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 77596 7 transmembrane receptor (Secretin family) 77605 Core histone H2A/H2B/H3/H4 77652 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 77652 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 77669 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 77683 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 77701 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 77704 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 77704 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 77705 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 77705 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 77739 Thrombospondin type 1 domain 77754 CDC45-like protein. CDC45 is an essential gene required for initiation of DNA replication in S. cerevisiae, forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human, mouse and smut fungus among others 77782 DNA polymerase family A 77782 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 77799 SH2 domain 77864 Yippee putative zinc-binding protein 77935 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 77951 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 77974 S-adenosyl-L-homocysteine hydrolase 77974 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 77976 Protein kinase domain 77987 Sec63 domain 77987 pfam02889, Sec63, Sec63 domain 77996 CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide vari 77998 Galactoside-binding lectin 78016 Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element 78016 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 78038 Acetyl co-enzyme A carboxylase carboxyltransferase alpha subunit. Acetyl co-enzyme A carboxylase carboxyltransferase is composed of an alpha and beta subunit 78038 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 78070 Choline/Carnitine o-acyltransferase 78076 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 78081 SCP-like extracellular protein 78081 SCP-like extracellular protein. This domain is also found in prokaryotes 78088 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 78088 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 78088 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 78134 7 transmembrane receptor (rhodopsin family) 78134 7 transmembrane receptor (rhodopsin family) 78239 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 78240 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 78246 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 78249 7 transmembrane receptor (Secretin family) 78249 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 78251 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 78255 PH domain. PH stands for pleckstrin homology 78267 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 78285 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 78285 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 78286 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 78286 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 78287 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 78294 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 78294 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis o 78304 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 78354 Phospholipase A2 inhibitor 78369 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, 78373 NUDIX domain 78388 Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein 78394 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 78394 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 78405 Nerve growth factor family 78416 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 78444 Pyroglutamyl peptidase 78455 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 78455 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t 78507 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 78509 Plus-3 domain. This domain is about 90 residues in length and is often found associated with the pfam02213 domain. The function of this domain is uncertain. It is possible that this domain is involved in DNA binding as it has three conserved positively ch 78514 PH domain. PH stands for pleckstrin homology 78514 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 78514 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 78523 Ribosomal protein L9, N-terminal domain 78560 7 transmembrane receptor (Secretin family) 78560 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 78609 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 78609 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 78618 PH domain. PH stands for pleckstrin homology 78618 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 78618 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 78618 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 78619 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 78619 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 78625 AMP-binding enzyme 78651 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 78653 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 78658 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 78670 PH domain. PH stands for pleckstrin homology 78670 PH domain. PH stands for pleckstrin homology 78689 Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex 78697 Uncharacterized protein family UPF0024 78709 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 78753 ab-hydrolase associated lipase region 78753 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 78754 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 78771 C2 domain 78783 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 78783 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 78797 Flavodoxin 78797 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 78797 FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase 78798 WD domain, G-beta repeat 78798 HELP domain. The HELP (Hydrophobic ELP) domain is found in EMAP and EMAP-like proteins (ELPs). Although called a domain it contains a predicted transmembrane helix and may not form a globular domain. It is also not clear if these proteins localize to memb 78798 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 78803 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 78803 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 78808 Synaptobrevin 78808 Synaptobrevin 78816 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 78816 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 78826 7 transmembrane receptor (rhodopsin family) 78829 TSC-22/dip/bun family 78830 Mitochondrial carrier protein 78892 LCCL domain 78912 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 78912 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 78920 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 78920 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 78923 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 78923 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 78925 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 78928 Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to ne 78928 Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to new 78940 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H 78947 Mannosyl oligosaccharide glucosidase. This is a family of eukaryotic enzymes belonging to glycosyl hydrolase family 63. They catalyse the specific cleavage of the non-reducing terminal glucose residue from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide 78951 Hsp20/alpha crystallin family 78956 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 78957 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 78959 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 78962 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 78963 NB-ARC domain 78963 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 78964 7 transmembrane receptor (rhodopsin family) 78969 Protein kinase domain 78971 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 78976 7 transmembrane receptor (rhodopsin family) 78980 FecCD transport family 78985 7 transmembrane receptor (rhodopsin family) 78986 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 78989 Lectin C-type domain. This family includes both long and short form C-type 78999 Fibronectin type III domain 78999 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 79000 Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases 79008 Endo/excinuclease amino terminal domain. This domain has no known function it is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE 79008 Endo/excinuclease amino terminal domain. This domain has no known function it is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE 79020 GMC oxidoreductase. This family of proteins bind FAD as a cofactor 79027 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 79031 Phosducin 79033 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 79036 Protein of unknown function DUF52 79043 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 79047 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 79048 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 79049 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk 79050 CBF/Mak21 family 79053 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum 79054 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 79058 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 79059 Nucleoside diphosphate kinase 79065 Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targeti 79071 GNS1/SUR4 family 79083 Rabphilin-3A effector domain 79085 Mitochondrial carrier protein 79087 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 79089 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 79092 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 79092 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 79095 Uncharacterised protein family (UPF0184) 79107 Animal haem peroxidase 79107 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 79113 SH2 domain 79113 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 79115 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG 79123 Amiloride-sensitive sodium channel 79124 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 79125 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 79127 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 79128 Phosphotyrosine interaction domain (PTB/PID) 79131 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 79148 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 79148 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 79148 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 79148 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 79148 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 79149 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 79152 Fatty acid hydroxylase 79152 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 79153 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 79156 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 79175 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 79177 Zinc finger 79178 Adenylate cyclase. One member was found to complement mutants of E. coli. cya. This protein has been shown to be an adenylate kinase 79183 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 79183 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 79191 Homeobox domain 79192 Homeobox domain 79196 Oxysterol-binding protein 79196 Oxysterol-binding protein 79196 PH domain. PH stands for pleckstrin homology 79201 TNFR/NGFR cysteine-rich region 79208 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 79211 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 79211 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 79212 Sodium:neurotransmitter symporter family 79213 Sodium:neurotransmitter symporter family 79214 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 79217 Connexin 79219 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 79220 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 79222 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 79224 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 79225 Helix-loop-helix DNA-binding domain 79237 Homeobox domain 79238 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 79239 Heme oxygenase 79240 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 79240 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 79242 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 79243 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 79244 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 79244 MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes 79244 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 79246 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 79246 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 79247 7 transmembrane receptor (rhodopsin family) 79249 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 79252 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 79252 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 79254 Ferrous iron transport protein B 79255 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 79257 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 79339 7 transmembrane receptor (rhodopsin family) 79345 7 transmembrane receptor (rhodopsin family) 79361 Syntaxin 79362 Helix-loop-helix DNA-binding domain 79363 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 79365 Helix-loop-helix DNA-binding domain 79368 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 79369 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 79369 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 79370 Apoptosis regulator proteins, Bcl-2 family 79370 Apoptosis regulator proteins, Bcl-2 family 79400 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 79411 Glycosyl hydrolases family 35 79414 Fibronectin type III domain 79414 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 79423 Stathmin family 79425 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 79430 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 79430 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 79431 Helix-loop-helix DNA-binding domain 79433 Myosin head (motor domain) 79443 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 79449 Ribosomal protein L21e 79451 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 79455 Phosducin 79462 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 79463 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 79465 Class I Histocompatibility antigen, domains alpha 1 and 2 79541 7 transmembrane receptor (rhodopsin family) 79557 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 79557 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 79558 Protein kinase domain 79558 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 79563 GrpE 79589 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 79602 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 79603 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 79623 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 79629 Occludin/ELL family 79634 TT viral orf 1. TT virus (TTV) 79637 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 79644 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 79646 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 79648 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 79651 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 79660 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 79661 Formamidopyrimidine-DNA glycosylase 79666 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 79668 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 79679 Protein of unknown function (DUF431) 79695 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 79705 Protein kinase domain 79709 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va 79714 Ezrin/radixin/moesin family 79716 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 79717 DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism 79723 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 79726 WD domain, G-beta repeat 79727 'Cold-shock' DNA-binding domain 79734 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 79739 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 79748 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 79751 Mitochondrial carrier protein 79753 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 79754 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 79758 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 79763 Isochorismatase family. This family are hydrolase enzymes 79765 GTPase of unknown function 79770 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 79776 Homeobox domain 79777 Acyl CoA binding protein 79778 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 79781 ATPase family associated with various cellular activities (AAA) 79784 Myosin head (motor domain) 79784 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 79786 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 79786 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 79788 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 79796 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 79799 UDP-glucoronosyl and UDP-glucosyl transferase 79807 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 79813 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 79817 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 79829 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 79842 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 79856 PX domain. PX domains bind to phosphoinositides 79862 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 79871 Domain of Unknown Function (DUF408) 79874 Rabaptin 79876 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 79877 Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction 79901 Cytochrome b561 79902 Chitinase 79903 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 79912 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 79915 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 79920 F-box domain 79923 Homeobox domain 79930 PTB domain (IRS-1 type) 79935 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 79937 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 79948 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 79957 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 79958 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 79958 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 79958 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 79961 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 79961 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 79961 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 79973 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 79974 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 79984 Carboxylesterase 79990 PH domain. PH stands for pleckstrin homology 79993 GNS1/SUR4 family 80011 Bacterial surface antigen. This entry includes the following surface antigens; D15 antigen from H.influenzae 80012 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 80019 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 80024 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 80025 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 80025 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 80025 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 80025 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 80025 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 80025 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 80032 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 80036 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 80052 Ribosomal protein S5, C-terminal domain 80055 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 80068 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 80068 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 80070 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 80070 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 80086 Tubulin/FtsZ family. This family includes the tubulin alpha 80117 ADP-ribosylation factor family 80125 C2 domain 80128 B-box zinc finger 80142 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 80144 Calx-beta domain 80144 Giardia variant-specific surface protein 80144 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 80146 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 80148 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 80153 YjeF-related protein N-terminus 80156 Aldo/keto reductase family 80168 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 80201 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 80204 F-box domain 80204 F-box domain 80204 F-box domain 80205 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 80216 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 80223 C2 domain 80230 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 80230 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 80235 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 80237 Occludin/ELL family 80262 Uncharacterised protein family (UPF0183). This family of proteins includes Lin-10 from C. elegans 80263 B-box zinc finger 80267 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 80267 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 80270 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 80271 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 80273 GrpE 80281 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 80283 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 80283 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 80290 7 transmembrane receptor (rhodopsin family) 80296 Chlorophyll A-B binding protein 80297 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 80298 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 80307 C2 domain 80307 C2 domain 80310 CUB domain 80310 CUB domain 80311 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 80311 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 80312 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 80315 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 80325 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 80325 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 80325 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 80326 wnt family 80328 Class I Histocompatibility antigen, domains alpha 1 and 2 80329 Class I Histocompatibility antigen, domains alpha 1 and 2 80331 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 80332 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 80332 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 80332 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 80332 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 80334 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 80338 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 80338 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 80339 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 80346 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 80347 Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction 80350 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 80350 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 80351 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 80704 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 80706 7 transmembrane receptor (rhodopsin family) 80712 Homeobox domain 80714 PBX domain. The PBX domain is a bipartite acidic domain 80718 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 80720 PBX domain. The PBX domain is a bipartite acidic domain 80721 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 80722 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 80728 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 80728 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 80732 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 80733 Eukaryotic-type carbonic anhydrase 80733 Eukaryotic-type carbonic anhydrase 80733 Eukaryotic-type carbonic anhydrase 80740 u-PAR/Ly-6 domain 80745 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 80749 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 80754 Rabaptin 80765 START domain 80765 START domain 80773 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 80774 LIM domain. This family represents two copies of the LIM structural domain 80777 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 80781 Thrombospondin N-terminal -like domain 80781 Thrombospondin N-terminal -like domain 80781 Thrombospondin N-terminal -like domain 80790 PH domain 80794 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 80794 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co 80794 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser 80796 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 80820 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 80821 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 80824 Protein-tyrosine phosphatase 80824 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 80824 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 80834 7 transmembrane receptor (metabotropic glutamate family) 80834 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 80835 7 transmembrane receptor (metabotropic glutamate family) 80835 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 80835 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 80835 7 transmembrane receptor (metabotropic glutamate family) 80835 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 80835 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 80838 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 80840 7 transmembrane receptor (rhodopsin family) 80841 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 80843 eubacterial secY protein 80848 CUB domain 80848 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 80852 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 80853 jmjC domain 80854 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 80857 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 80859 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 80860 GH3 auxin-responsive promoter 80877 Beige/BEACH domain 80878 Monocarboxylate transporter 80878 Sugar (and other) transporter 80879 Monocarboxylate transporter 80883 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 80884 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 80885 7 transmembrane receptor (rhodopsin family) 80886 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 80888 Hsp20/alpha crystallin family 80890 B-box zinc finger 80890 Filamin/ABP280 repeat 80890 Zinc finger, C3HC4 type (RING finger) 80890 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 80890 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid 80890 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph 80891 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 80892 Homeobox domain 80897 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 80898 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 80899 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 80899 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 80900 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 80900 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 80901 7 transmembrane receptor (rhodopsin family) 80903 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 80905 impB/mucB/samB family. These proteins are involved in UV protection 80908 Glycosyltransferase family 6 80910 7 transmembrane receptor (rhodopsin family) 80911 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 80978 7 transmembrane receptor (rhodopsin family) 80979 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 80981 ADP-ribosylation factor family 80985 B-box zinc finger 81003 B-box zinc finger 81003 ADP-ribosylation factor family 81003 B-box zinc finger 81003 ADP-ribosylation factor family 81003 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 81003 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 81006 7 transmembrane receptor (rhodopsin family) 81010 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81011 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81012 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81013 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81014 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81015 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81016 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81017 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 81025 Connexin 81027 Tubulin/FtsZ family 81029 wnt family 81029 wnt family 81031 Sugar (and other) transporter 81033 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 81033 Cyclic nucleotide-binding domain 81034 Mitochondrial carrier protein 81285 7 transmembrane receptor (rhodopsin family) 81370 7 transmembrane receptor (rhodopsin family) 81472 7 transmembrane receptor (rhodopsin family) 81490 Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos 81491 7 transmembrane receptor (rhodopsin family) 81494 Sushi domain (SCR repeat) 81500 Eukaryotic aspartyl protease 81502 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 81502 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 81502 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 81502 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 81503 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 81504 SH2 domain 81504 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 81505 PMP-22/EMP/MP20/Claudin family 81506 ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion 81507 Protein kinase domain 81507 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 81508 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 81509 7 transmembrane receptor (rhodopsin family) 81512 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 81514 Fibronectin type III domain 81514 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 81515 SH2 domain 81515 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 81516 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 81516 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 81517 PTN/MK heparin-binding protein family 81518 SRF-type transcription factor (DNA-binding and dimerisation domain) 81519 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o 81521 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 81521 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 81527 CUB domain 81527 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 81529 3'5'-cyclic nucleotide phosphodiesterase 81530 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 81535 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 81536 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 81537 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 81539 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 81542 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 81551 Stathmin family 81557 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 81557 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 81557 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 81559 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 81562 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 81567 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 81567 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 81574 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 81578 von Willebrand factor type A domain 81578 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 81600 Glycosyl hydrolases family 18 81600 Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse 81601 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 81606 Phosphorylase family 81609 PX domain. PX domains bind to phosphoinositides 81617 Mo25 protein family 81618 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 81619 Tetraspanin family 81628 TSC-22/dip/bun family 81630 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 81630 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 81630 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 81631 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 81632 Aminotransferase class-III 81635 Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r 81636 Adenylate and Guanylate cyclase catalytic domain 81637 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 81637 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 81637 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 81638 7 transmembrane receptor (rhodopsin family) 81639 Lipoxygenase 81640 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi 81641 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 81644 7 transmembrane receptor (rhodopsin family) 81645 7 transmembrane receptor (rhodopsin family) 81646 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 81646 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 81646 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 81646 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 81650 Casein kinase II regulatory subunit 81651 Laminin G domain 81653 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 81654 e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit 81654 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 81654 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 81657 7 transmembrane receptor (metabotropic glutamate family) 81657 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 81658 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 81658 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 81659 Protein-tyrosine phosphatase 81659 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 81661 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 81662 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 81663 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 81664 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 81666 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 81667 WD domain, G-beta repeat 81668 7 transmembrane receptor (rhodopsin family) 81672 7 transmembrane receptor (metabotropic glutamate family) 81672 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 81674 Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subu 81677 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 81678 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 81678 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 81679 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 81683 Macrophage migration inhibitory factor (MIF) 81684 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but fun 81686 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 81686 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 81686 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 81687 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 81687 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 81696 7 transmembrane receptor (rhodopsin family) 81697 7 transmembrane receptor (rhodopsin family) 81707 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 81707 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 81707 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 81708 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 81710 Homeobox domain 81721 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 81721 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 81724 Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barre 81725 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 81729 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 81733 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 81734 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 81735 Fibronectin type III domain 81735 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 81737 Nerve growth factor family 81738 Conserved nucleoporin domain 81739 ATP P2X receptor 81742 3'5'-cyclic nucleotide phosphodiesterase 81743 3'5'-cyclic nucleotide phosphodiesterase 81749 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 81750 Laminin G domain 81751 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 81752 7 transmembrane receptor (rhodopsin family) 81753 7 transmembrane receptor (Secretin family) 81754 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 81755 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 81756 ADP-ribosylation factor family 81756 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 81756 Periplasmic solute binding protein family. This family includes periplasmic solute binding proteins such as TroA that interacts with an ATP-binding cassette transport system in Treponema pallidum 81756 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 81758 Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted 81758 Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve 81760 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 81761 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 81763 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 81764 Ribosomal L10 81765 Ribosomal protein L13e 81766 Eukaryotic ribosomal protein L18 81767 Ribosomal protein L19e 81768 Ribosomal L22e protein family 81769 Ribosomal protein L44 81770 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 81773 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 81774 Ribosomal protein S17 81775 Ribosomal protein S21e 81777 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 81777 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis o 81778 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 81779 7 transmembrane receptor (Secretin family) 81785 HMG (high mobility group) box 81785 Structure-specific recognition protein 81786 B-box zinc finger 81789 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 81789 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 81792 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 81792 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 81793 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 81794 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 81794 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 81796 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 81796 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 81797 7 transmembrane receptor (rhodopsin family) 81799 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 81801 Stanniocalcin family 81802 Syntaxin 81804 Sec1 family 81805 Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity 81805 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 81806 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 81808 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 81808 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 81809 Transforming growth factor beta like domain 81809 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 81810 Protein kinase domain 81811 Erythropoietin/thrombopoietin 81812 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 81813 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 81815 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 81817 Helix-loop-helix DNA-binding domain 81818 Intermediate filament protein 81819 'Paired box' domain 81820 C2 domain 81822 Actin 81826 Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor 81828 Zona pellucida-like domain 81831 CUB domain 81832 CUB domain 81832 Low-density lipoprotein receptor domain class A 81832 CUB domain 81832 CUB domain 81832 Low-density lipoprotein receptor domain class A 81840 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 81844 Zinc finger, C3HC4 type (RING finger) 81846 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 81846 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 81846 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 81849 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 81853 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 81855 Tricarboxylate carrier 81869 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 81876 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 81877 Fibronectin type III domain 81879 Sterile alpha motif (SAM)/Pointed domain 81887 Las1-like. Las1 is an essential nuclear protein involved in cell morphogenesis and cell surface growth 81894 Mitochondrial carrier protein 81896 WD domain, G-beta repeat 81897 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 81904 PMP-22/EMP/MP20/Claudin family 81905 PMP-22/EMP/MP20/Claudin family 81905 PMP-22/EMP/MP20/Claudin family 81906 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 81906 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 81910 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 81919 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 81924 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 81925 Phosphatidate cytidylyltransferase 81930 Kinesin motor domain 83259 Cadherin domain 83259 Cadherin domain 83259 Cadherin domain 83379 Glycosyl hydrolase family 1 83383 Helix-loop-helix DNA-binding domain 83394 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 83397 Protein kinase A anchor 83398 Sulfotransferase protein 83398 Sulfotransferase protein 83401 GNS1/SUR4 family 83409 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 83410 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 83422 pfam02891, zf-MIZ, MIZ zinc finger 83422 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 83425 Protein kinase A anchor 83429 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 83430 Interleukin-6/G-CSF/MGF family 83431 Ezrin/radixin/moesin family 83439 HMG (high mobility group) box 83447 Mitochondrial carrier protein 83453 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 83454 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 83457 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 83462 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 83463 Helix-loop-helix DNA-binding domain 83465 Cadherin domain 83465 7 transmembrane receptor (Secretin family) 83465 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 83465 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 83465 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 83465 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 83466 Cadherin domain 83466 7 transmembrane receptor (Secretin family) 83466 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 83466 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 83468 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 83469 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 83474 HMG (high mobility group) box 83476 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 83477 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 83481 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 83493 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 83497 Occludin/ELL family 83500 Sugar (and other) transporter 83501 Cadherin domain 83501 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 83502 Cadherin domain 83502 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 83503 RNA polymerase Rpb6. Rpb6 is an essential subunit in the eukaryotic polymerases Pol I, II and III. The bacterial equivalent to Rpb6 is the omega subunit. Rpb6 and omega are structurally conserved and both function in polymerase assembly 83504 Glycosyl hydrolase family 1 83505 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 83510 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 83511 Sodium:neurotransmitter symporter family 83512 Fatty acid desaturase 83512 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 83513 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 83513 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 83513 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 83514 TSC-22/dip/bun family 83515 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 83518 7 transmembrane receptor (rhodopsin family) 83522 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 83523 Inositol monophosphatase family 83529 Eukaryotic porin 83531 Eukaryotic porin 83532 Eukaryotic porin 83533 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 83535 Inward rectifier potassium channel 83536 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 83536 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 83537 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 83539 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 83540 Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein h 83540 Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein h 83546 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 83548 Sec34-like family. Sec34 and Sec35 form a sub-complex, in a seven protein complex that includes Dor1 (PFAM:PF04124). This complex is thought to be important for tether vesicles to the Golgi 83550 7 transmembrane receptor (rhodopsin family) 83551 7 transmembrane receptor (rhodopsin family) 83561 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 83563 Ubiquitin carboxyl-terminal hydrolase family 2 83564 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 83567 7 transmembrane receptor (rhodopsin family) 83568 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 83568 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 83571 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 83574 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 83574 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 83577 von Willebrand factor type A domain 83577 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 83577 von Willebrand factor type A domain 83577 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 83581 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu 83581 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 83582 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com 83583 Prenylated rab acceptor (PRA1) 83586 Helix-loop-helix DNA-binding domain 83588 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 83588 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 83589 Phosphotyrosine interaction domain (PTB/PID) 83589 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 83590 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 83592 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 83593 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 83593 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 83593 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 83595 HMG (high mobility group) box 83602 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 83602 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 83603 GNS1/SUR4 family 83610 Phosphotyrosine interaction domain (PTB/PID) 83610 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 83611 Phosphotyrosine interaction domain (PTB/PID) 83611 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 83612 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 83613 Intermediate filament protein 83614 pfam02891, zf-MIZ, MIZ zinc finger 83624 Cyclophilin type peptidyl-prolyl cis-trans isomerase 83627 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 83628 Tetraspanin family 83630 Homeobox domain 83632 MYND finger 83632 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 83633 7 transmembrane receptor (metabotropic glutamate family) 83633 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 83635 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 83642 Uncharacterized ACR, YdiU/UPF0061 family 83643 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 83657 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 83658 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 83658 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 83658 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 83659 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 83660 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 83661 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 83671 C2 domain 83672 C2 domain 83672 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 83672 C2 domain 83672 Hydantoinase B/oxoprolinase. This family includes N-methylhydaintoinase B which converts hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase EC:3.5.2.9 which catalyses the formation of L-glutamate from 5-oxo-L-proline. These enzymes are part of the ox 83681 SH2 domain 83683 7 transmembrane receptor (rhodopsin family) 83685 C2 domain 83685 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 83686 Cyclic nucleotide-binding domain 83686 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 83688 Transaldolase 83690 LCCL domain 83691 LCCL domain 83694 MIT domain 83697 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 83702 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 83703 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 83707 Domain of unknown function DUF60. The proteins in this family have no known function. Two proteins have been annotated as phosphotransferases, however this is not supported experimentally, and should be viewed with caution 83716 LCCL domain 83719 Yippee putative zinc-binding protein 83720 Cadherin domain 83720 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 83721 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 83721 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 83721 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The 83722 POLO box duplicated region 83727 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 83728 Cadherin domain 83728 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 83730 Synaptobrevin 83731 Calcium-activated BK potassium channel alpha subunit 83731 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 83733 Mitochondrial carrier protein 83734 Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 83741 Transcription factor AP-2 83744 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 83756 7 transmembrane receptor (metabotropic glutamate family) 83756 Cell cycle protein. This entry includes the following members 83756 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 83758 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 83759 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol 83762 C2 domain 83764 Flotillin family. Flotillins are integral membrane proteins that have been shown to be present in several subcellular components, including caveolae (invaginated plasma membrane microdomains), lipid rafts (sphingolipid and cholesterol-rich, detergent-res 83765 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 83767 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 83768 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 83770 7 transmembrane receptor (metabotropic glutamate family) 83770 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 83771 7 transmembrane receptor (metabotropic glutamate family) 83771 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 83781 Galactoside-binding lectin 83782 Nucleoside diphosphate kinase 83783 Sulfotransferase protein 83788 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 83788 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 83790 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 83791 Polyprenyl synthetase 83796 BAF60b domain of the SWIB complex 83799 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 83800 Clathrin light chain 83803 pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase 83805 SH2 domain 83805 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 83808 UDP-glucoronosyl and UDP-glucosyl transferase 83809 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 83810 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 83814 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 83817 Carboxylesterase 83819 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 83826 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 83826 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 83830 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 83833 BAF60b domain of the SWIB complex 83842 Choline/Carnitine o-acyltransferase 83844 Ubiquitin carboxyl-terminal hydrolase family 2 83849 C2 domain 83849 C2 domain 83850 C2 domain 83851 C2 domain 83851 C2 domain 83852 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 83858 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 83873 7 transmembrane receptor (rhodopsin family) 83875 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 83881 Homeobox domain 83882 Tetraspanin family 83884 Mitochondrial carrier protein 83885 Mitochondrial carrier protein 83887 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 83891 PX domain. PX domains bind to phosphoinositides 83892 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 83925 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d 83928 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 83931 Protein kinase domain 83931 Protein kinase domain 83937 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 83945 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 83945 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 83945 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 83954 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 83959 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 83965 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 83993 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 83995 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 83995 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 83995 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 83995 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 83996 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 83996 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 83996 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 83999 WSC domain. This domain may be involved in carbohydrate binding 83999 WSC domain. This domain may be involved in carbohydrate binding 84002 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 84005 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 84006 Protein kinase domain 84006 Protein kinase C terminal domain 84007 7 transmembrane receptor (rhodopsin family) 84008 Fibrinogen beta and gamma chains, C-terminal globular domain 84008 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 84009 PH domain. PH stands for pleckstrin homology 84009 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 84010 Delta serrate ligand 84012 Sodium:dicarboxylate symporter family 84018 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 84019 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 84020 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 84020 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 84022 7 transmembrane receptor (rhodopsin family) 84023 7 transmembrane receptor (rhodopsin family) 84024 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 84024 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 84024 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H 84028 Lipase 84028 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 84029 FMN-dependent dehydrogenase 84030 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 84030 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 84031 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 84031 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 84032 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 84032 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 84035 WSC domain. This domain may be involved in carbohydrate binding 84036 Calcium-activated SK potassium channel 84036 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 84036 Calcium-activated SK potassium channel 84036 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 84036 Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-subunits and the Ca 84036 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-s 84046 HMG (high mobility group) box 84050 Somatomedin B domain 84050 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 84057 Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair 84059 Calx-beta domain 84059 7 transmembrane receptor (Secretin family) 84059 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 84061 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 84072 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 84074 High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, gluteni 84078 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 84078 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 84079 Vacuolar sorting protein 9 (VPS9) domain 84083 HNH endonuclease 84085 F-box domain 84092 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 84100 ADP-ribosylation factor family 84100 ADP-ribosylation factor family 84101 Ubiquitin carboxyl-terminal hydrolases family 2 84101 Zn-finger in ubiquitin-hydrolases and other protein 84102 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha 84105 Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha) 84106 Vinculin family 84108 Zinc finger, C3HC4 type (RING finger) 84111 7 transmembrane receptor (rhodopsin family) 84112 7 transmembrane receptor (rhodopsin family) 84127 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 84132 Ubiquitin carboxyl-terminal hydrolase family 2 84132 Ubiquitin carboxyl-terminal hydrolases family 2 84134 Eukaryotic porin 84154 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 84159 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 84163 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 84164 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 84166 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 84171 Lysyl oxidase 84172 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com 84174 SH2 domain 84174 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 84174 SH2 domain 84174 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 84181 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 84188 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 84189 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 84193 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 84197 Protein kinase domain 84203 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 84217 MYND finger 84220 RanBP1 domain 84220 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 84225 MYND finger 84236 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 84239 E1-E2 ATPase 84239 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 84243 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 84249 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 84251 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 84253 Rap/ran-GAP 84253 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 84258 C2 domain 84259 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 84263 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 84263 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 84264 Metallo-beta-lactamase superfamily 84274 ubiE/COQ5 methyltransferase family 84275 Mitochondrial carrier protein 84277 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 84279 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 84280 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 84301 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 84306 Programmed cell death protein 2, C-terminal domain 84315 SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs 84320 Acyl CoA binding protein 84326 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 84342 Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35 84347 PMP-22/EMP/MP20/Claudin family 84348 7 transmembrane receptor (rhodopsin family) 84349 TNF(Tumor Necrosis Factor) family 84350 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 84350 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 84352 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 84353 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 84355 Heavy-metal-associated domain 84357 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 84359 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 84360 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 84360 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 84364 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 84377 Somatotropin hormone family 84383 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 84384 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 84385 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 84385 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 84386 WAP-type (Whey Acidic Protein) 'four-disulfide core' 84390 Cyclic nucleotide-binding domain 84390 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 84391 Kinesin motor domain 84395 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 84396 Sodium / potassium ATPase beta chain 84399 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 84399 Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation 84401 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 84403 Connexin 84405 Interleukin-12 alpha subunit. Interleukin 12 (IL-12) is a disulphide-bonded heterodimer consisting of a 35kDa alpha subunit and a 40kDa beta subunit. It is involved in the stimulation and maintenance of Th1 cellular immune responses, including the normal 84406 Sulfotransferase protein 84407 Cadherin domain 84407 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 84410 Zinc finger 84422 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron 84423 Transforming growth factor beta like domain 84424 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 84426 wnt family 84429 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 84431 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 84435 7 transmembrane receptor (Secretin family) 84436 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 84439 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 84448 Villin headpiece domain 84448 LIM domain. This family represents two copies of the LIM structural domain 84451 Protein kinase domain 84456 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 84456 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 84457 Fibronectin type III domain 84457 Fibronectin type III domain 84464 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 84467 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 84469 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 84471 PX domain. PX domains bind to phosphoinositides 84471 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 84475 7 transmembrane receptor (rhodopsin family) 84476 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 84478 Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family inclu 84481 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 84484 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 84485 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 84491 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 84492 SH2 domain 84492 Protein kinase domain 84492 PH domain. PH stands for pleckstrin homology 84492 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 84492 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 84493 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 84494 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 84495 XRCC1 N terminal domain 84495 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 84496 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 84503 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 84504 Homeobox domain 84505 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 84510 Stathmin family 84511 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 84511 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 84514 GH3 auxin-responsive promoter 84517 Actin 84527 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 84528 Homeobox domain 84539 7 transmembrane receptor (rhodopsin family) 84541 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 84541 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 84550 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 84556 C2 domain 84556 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 84557 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 84557 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 84574 LIM domain. This family represents two copies of the LIM structural domain 84575 Fatty acid desaturase 84575 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 84576 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 84578 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 84583 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 84585 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 84586 Fibrinogen beta and gamma chains, C-terminal globular domain 84586 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 84588 PMP-22/EMP/MP20/Claudin family 84589 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 84590 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 84592 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 84596 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 84597 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 84599 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 84608 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 84608 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 84612 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 84614 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 84620 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 84623 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 84628 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 84630 Protein kinase domain 84631 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 84632 PH domain. PH stands for pleckstrin homology 84634 7 transmembrane receptor (rhodopsin family) 84636 7 transmembrane receptor (rhodopsin family) 84643 Kinesin motor domain 84649 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 84653 Helix-loop-helix DNA-binding domain 84656 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 84658 7 transmembrane receptor (Secretin family) 84658 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 84665 Immunoglobulin domain 84667 Helix-loop-helix DNA-binding domain 84669 Ubiquitin carboxyl-terminal hydrolase family 2 84675 B-box zinc finger 84676 B-box zinc finger 84678 jmjC domain 84678 F-box domain 84678 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 84678 F-box domain 84678 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 84679 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 84681 HIT family 84682 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 84683 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 84684 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 84689 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 84693 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 84694 Connexin 84695 Lysyl oxidase 84701 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 84717 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 84733 'chromo' (CHRromatin Organization MOdifier) domain 84749 Ubiquitin carboxyl-terminal hydrolase family 2 84750 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 84752 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 84759 Zinc finger, C3HC4 type (RING finger) 84767 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 84769 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 84812 PH domain. PH stands for pleckstrin homology 84821 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 84823 Intermediate filament protein 84823 Intermediate filament tail domain 84838 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 84838 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 84839 Homeobox domain 84844 Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs 84851 B-box zinc finger 84851 Zinc finger, C3HC4 type (RING finger) 84861 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 84868 Immunoglobulin domain 84869 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 84873 7 transmembrane receptor (Secretin family) 84873 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 84878 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 84878 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 84885 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 84888 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 84888 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 84893 F-box domain 84893 F-box domain 84909 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 84913 Helix-loop-helix DNA-binding domain 84914 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 84936 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 84938 Peptidase family C54 84938 Peptidase family C54 84946 Low temperature viability protein 84948 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 84948 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 84950 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou 84950 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou 84952 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 84954 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 84955 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 84958 C2 domain 84967 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 84971 Peptidase family C54 84975 Sugar (and other) transporter 84976 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 84992 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 84993 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 85030 Death domain 85031 Lipase 85240 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 85240 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 85242 DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner 85243 PI3-kinase family, p85-binding domain 85243 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 85243 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 85244 IQ calmodulin-binding motif. Calmodulin-binding motif 85246 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 85247 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 85248 Kinesin motor domain 85250 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 85253 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 85255 Thiamine pyrophosphate enzyme, C-terminal TPP binding domain 85255 Thiamine pyrophosphate enzyme, N-terminal TPP binding domain 85255 Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold 85256 Sugar (and other) transporter 85257 Cyclic nucleotide-binding domain 85257 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 85258 Cyclic nucleotide-binding domain 85258 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 85259 Cyclic nucleotide-binding domain 85259 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 85260 GDA1/CD39 (nucleoside phosphatase) family 85261 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 85262 Mitochondrial carrier protein 85263 Mitochondrial carrier protein 85265 LIM domain. This family represents two copies of the LIM structural domain 85267 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 85268 7 transmembrane receptor (rhodopsin family) 85269 Nucleoside diphosphate kinase 85270 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 85271 Zona pellucida-like domain 85272 Transforming growth factor beta like domain 85273 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 85275 Guanylate kinase 85275 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 85275 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 85275 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 85301 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 85302 Intermediate filament protein 85302 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 85308 Uncharacterised protein family (UPF0172) 85309 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 85309 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 85311 Lipase 85315 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 85327 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 85333 Mitochondrial carrier protein 85358 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 85363 B-box zinc finger 85363 Zinc finger, C3HC4 type (RING finger) 85363 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 85363 Zinc finger, C3HC4 type (RING finger) 85363 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 85363 Zinc finger, C3HC4 type (RING finger) 85363 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 85365 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 85376 Fibronectin type III domain 85376 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 85377 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 85377 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 85378 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 85382 Globin 85406 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 85413 Sugar (and other) transporter 85415 Hr1 repeat 85415 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 85419 Beige/BEACH domain 85425 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 85430 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 85431 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 85433 Zinc carboxypeptidase 85435 MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins 85435 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s 85439 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 85443 Protein kinase domain 85445 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 85445 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 85446 Homeobox domain 85453 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 85458 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D 85461 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 85464 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 85464 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 85465 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 85472 DNA polymerase family A 85482 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 85482 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 85489 Helix-loop-helix DNA-binding domain 85491 Synaptobrevin 85493 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 85496 Somatomedin B domain 85496 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 85569 Galanin 86614 HSF-type DNA-binding 88066 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 88455 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 89122 B-box zinc finger 89122 B-box zinc finger 89776 Sugar (and other) transporter 89777 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 89778 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 89780 wnt family 89782 Leishmanolysin 89783 Golgi 4-transmembrane spanning transporter 89788 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 89789 Ergosterol biosynthesis ERG4/ERG24 family 89795 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 89797 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 89801 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 89803 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 89804 Protein kinase domain 89804 Fibronectin type III domain 89805 Uncharacterized ACR, COG1579 89805 Intermediate filament protein 89805 Fibrinogen beta and gamma chains, C-terminal globular domain 89807 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretin 89810 7 transmembrane receptor (rhodopsin family) 89813 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 89815 Synaptogyrin. This family of proteins is distantly related to pfam01284 89816 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 89818 Synaptobrevin 89819 G10 protein 89821 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 89822 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 89823 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 89824 Glycosyl hydrolases family 18 89825 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 89826 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 89832 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 89832 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 89832 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 89832 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 89842 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 89843 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 89846 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 89846 PH domain. PH stands for pleckstrin homology 89846 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 89846 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 89848 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 89849 WD domain, G-beta repeat 89852 Tetraspanin family 89857 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 89870 B-box zinc finger 89874 Mitochondrial carrier protein 89882 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction 89883 7 transmembrane receptor (rhodopsin family) 89884 Homeobox domain 89884 LIM domain. This family represents two copies of the LIM structural domain 89890 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 89890 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 89941 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 89944 Glycosyl hydrolases family 35 89970 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 89987 Helicase. This family consists of Helicases from the Herpes viruses. Helicases are responsible for the unwinding of DNA and are essential for replication and completion of the viral life cycle 90007 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 90133 Intermediate filament protein 90134 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 90134 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 90135 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 90139 Tetraspanin family 90167 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 90193 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 90199 WAP-type (Whey Acidic Protein) 'four-disulfide core' 90199 WAP-type (Whey Acidic Protein) 'four-disulfide core' 90199 WAP-type (Whey Acidic Protein) 'four-disulfide core' 90199 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 90203 PX domain. PX domains bind to phosphoinositides 90203 PX domain. PX domains bind to phosphoinositides 90249 Death domain 90249 Thrombospondin type 1 domain 90249 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 90293 BTB/POZ domain. The BTB (for BR-C 90293 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Kelch is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Sev 90317 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90321 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90342 C2 domain 90353 Uncharacterized protein family UPF0021 90378 Sterile alpha motif (SAM)/Pointed domain 90378 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 90408 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 90423 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 90462 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90470 Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate 90522 Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1 90525 SH2 domain 90576 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90592 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90594 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90627 START domain 90627 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 90673 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 90678 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 90701 Signal peptidase I 90705 Ribosomal protein L21e 90780 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 90787 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 90827 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 90864 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 90906 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 90906 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 90933 B-box zinc finger 90987 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 91012 Homeobox domain 91012 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 91107 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 91107 B-box zinc finger 91107 Zinc finger, C3HC4 type (RING finger) 91107 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 91115 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 91133 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 91137 Mitochondrial carrier protein 91151 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 91151 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 91156 Fibronectin type III domain 91181 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 91227 Gamma-glutamyltranspeptidase 91351 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 91355 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 91369 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 91373 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid 91408 NAC domain 91452 Acyl CoA binding protein 91461 Protein kinase domain 91464 Homeobox domain 91543 Radical SAM superfamily 91561 Ribosomal protein S5, C-terminal domain 91561 Ribosomal protein S5, N-terminal domain 91584 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 91646 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 91646 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 91653 Fibronectin type III domain 91661 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 91662 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 91746 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 91807 Protein kinase domain 91851 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 91893 Ferredoxin-fold anticodon binding domain. This is the anticodon binding domain found in some phenylalanyl tRNA synthetases. The domain has a ferredoxin fold 91917 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 92014 Mitochondrial carrier protein 92017 PX domain. PX domains bind to phosphoinositides 92086 Gamma-glutamyltranspeptidase 92086 Gamma-glutamyltranspeptidase 92086 Gamma-glutamyltranspeptidase 92105 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 92181 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 92211 Cadherin domain 92267 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 92370 Histidine acid phosphatase 92497 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 92552 Josephin 92597 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 92606 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 92609 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 92610 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 92610 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 92714 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 92736 Protein of unknown function, DUF270 92736 Protein of unknown function, DUF270 92745 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 92755 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 92755 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 92799 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 92806 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 92840 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 92856 IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing 92912 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 93010 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 93035 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 93035 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 93107 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 93107 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 93107 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 93166 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 93210 Per1-like. A member of this family has been implemented in protein processing in the endoplasmic reticulum 93225 Ribosomal protein L15 93273 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 93333 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 93349 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 93349 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 93395 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 93432 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 93436 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 93474 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 93517 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 93550 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 93550 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i 93587 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 93589 Cache domain 93594 Caldesmon 93611 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 93624 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 93649 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 93649 RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impl 93650 Histidine acid phosphatase 93650 Histidine acid phosphatase 93661 F-actin capping protein alpha subunit 93662 Cadherin domain 93672 Interleukin 10 93673 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 93674 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 93677 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 93685 GDA1/CD39 (nucleoside phosphatase) family 93687 Protein kinase domain 93688 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 93688 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 93689 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 93690 7 transmembrane receptor (rhodopsin family) 93692 Glutaredoxin 93710 Cadherin domain 93714 Cadherin domain 93715 Cadherin domain 93721 Zinc carboxypeptidase 93723 Cadherin domain 93728 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 93728 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 93732 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 93734 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 93735 wnt family 93739 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 93746 7 transmembrane receptor (metabotropic glutamate family) 93747 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 93747 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 93749 Chaperonin 10 Kd subunit 93757 Signal peptidase I 93760 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 93761 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 93762 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 93762 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 93762 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 93762 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 93837 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 93841 Ubiquitin carboxyl-terminal hydrolase, family 1 93842 Fibronectin type III domain 93873 Cadherin domain 93874 Cadherin domain 93875 Cadherin domain 93876 Cadherin domain 93877 Cadherin domain 93878 Cadherin domain 93879 Cadherin domain 93880 Cadherin domain 93881 Cadherin domain 93882 Cadherin domain 93883 Cadherin domain 93885 Cadherin domain 93888 Cadherin domain 93889 Cadherin domain 93892 Cadherin domain 93896 7 transmembrane receptor (Secretin family) 93896 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 93897 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 93898 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 93953 Putative metallopeptidase (SprT family). This family of uncharacterised proteins may be zinc metallopeptidases 93961 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 93979 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 94030 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 94033 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 94039 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 94045 ATP P2X receptor 94045 ATP P2X receptor 94056 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 94061 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 94062 Ribosomal protein L3 94081 Tricarboxylate carrier 94086 Hsp20/alpha crystallin family 94088 B-box zinc finger 94088 Zinc finger, C3HC4 type (RING finger) 94088 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 94089 B-box zinc finger 94090 B-box zinc finger 94090 Fibronectin type III domain 94090 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 94091 B-box zinc finger 94091 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 94092 B-box zinc finger 94094 B-box zinc finger 94094 Zinc finger, C3HC4 type (RING finger) 94097 Tricarboxylate carrier 94103 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 94106 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 94109 CUB domain 94109 Sushi domain (SCR repeat) 94120 C2 domain 94120 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 94121 C2 domain 94121 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 94122 C2 domain 94122 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 94134 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 94160 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 94160 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 94160 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 94164 Globin 94168 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 94173 Isocitrate/isopropylmalate dehydrogenase 94174 Papain family cysteine protease 94175 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 94178 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 94179 Intermediate filament protein 94185 Death domain 94191 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 94192 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 94193 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 94193 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 94193 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 94194 Cytochrome c oxidase subunit Vb 94195 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 94198 Proteasome A-type and B-type 94199 Renal dipeptidase 94200 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 94214 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 94215 UDP-glucoronosyl and UDP-glucosyl transferase 94216 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 94217 Low-density lipoprotein receptor domain class A 94217 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 94221 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 94222 Helix-loop-helix DNA-binding domain 94224 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 94226 7 transmembrane receptor (rhodopsin family) 94229 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 94230 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 94232 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 94233 7 transmembrane receptor (rhodopsin family) 94234 Fork head domain 94242 Papain family cysteine protease 94249 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 94253 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 94266 Ribosomal protein S27 94267 NUDIX domain 94268 Ephrin 94275 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 94279 Tricarboxylate carrier 94280 Tricarboxylate carrier 94281 Tricarboxylate carrier 94282 Tricarboxylate carrier 94282 Tricarboxylate carrier 94284 UDP-glucoronosyl and UDP-glucosyl transferase 94284 UDP-glucoronosyl and UDP-glucosyl transferase 94338 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN 94339 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 94341 Inward rectifier potassium channel 94342 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 94352 Lysyl oxidase 94352 Lysyl oxidase 94352 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 94468 3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin 95803 Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the tyr 96764 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 96935 Sushi domain (SCR repeat) 96957 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 97086 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 97086 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 97114 Core histone H2A/H2B/H3/H4 97165 HMG (high mobility group) box 97165 HMG (high mobility group) box 97212 3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin 97212 3-hydroxyacyl-CoA dehydrogenase, C-terminal domain. This family also includes lambda crystallin. Some proteins include two copies of this domain 97212 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 97440 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 97484 Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35 97487 Synaptogyrin. This family of proteins is distantly related to pfam01284 97541 tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a 97541 tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla 97795 Laminin N-terminal (Domain VI) 97827 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi 97884 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 98027 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 98256 Monooxygenase. This family includes diverse enzymes that utilise FAD 98258 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 98366 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 98386 Ergosterol biosynthesis ERG4/ERG24 family 98388 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 98396 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha 98396 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha 98396 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples hav 98402 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 98417 Cornichon protein 98432 PH domain. PH stands for pleckstrin homology 98432 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 98660 E1-E2 ATPase 98660 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 98660 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 98660 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 98682 LacY proton/sugar symporter. This family is closely related to the sugar transporter family 98711 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 98733 Fibronectin type III domain 98733 Fibronectin type III domain 98733 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 98741 Kv2 voltage-gated K+ channel 98741 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 98835 Integral membrane protein DUF106. This archaebacterial protein family has no known function. Members are predicted to be integral membrane proteins 98848 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 98848 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo 98870 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 98870 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f 98910 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 98932 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 99010 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 99031 Oxysterol-binding protein 99151 Anp1. The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins colocalize within the cis Golgi, and that they are physically associated in two distinct complexes 99151 Anp1. The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins co-localize within the cis Golgi, and that they are physically associated in two distinct complexes 99151 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var 99152 Cullin family 99237 Endomembrane protein 70 99296 7 transmembrane receptor (rhodopsin family) 99326 Rap/ran-GAP 99326 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 99334 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 99375 Cullin family 99377 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 99377 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 99382 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 99382 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 99382 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 99439 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 99439 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 99439 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 99512 WD domain, G-beta repeat 99543 Olfactomedin-like domain 99571 Fibrinogen beta and gamma chains, C-terminal globular domain 99633 7 transmembrane receptor (Secretin family) 99633 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 99633 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coup 99650 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 99683 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 99689 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 99689 TT viral orf 1. TT virus (TTV), isolated initially from a Japanese patient with hepatitis of unknown aetiology, has since been found to infect both healthy and diseased individuals and numerous prevalence studies have raised questions about its role in un 99689 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 99696 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 99696 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 99730 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon 99738 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 99738 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 99929 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 99929 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 99929 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 99929 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 100061 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 100066 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 100090 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 100102 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 100121 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 100129 7 transmembrane receptor (rhodopsin family) 100163 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 100198 Glucose-6-phosphate dehydrogenase, C-terminal domain 100198 Glucose-6-phosphate dehydrogenase, NAD binding domain 100198 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 100206 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi 100206 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisin 100210 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 100213 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 100273 Oxysterol-binding protein 100273 Oxysterol-binding protein 100336 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 100383 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 100383 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 100465 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 100465 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 100559 UDP-glucoronosyl and UDP-glucosyl transferase 100561 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 100608 CBF/Mak21 family 100727 UDP-glucoronosyl and UDP-glucosyl transferase 100732 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 100732 EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the 100737 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 100756 Ubiquitin carboxyl-terminal hydrolase family 2 100756 Ubiquitin carboxyl-terminal hydrolases family 2 100763 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 100929 Flavodoxin 100929 Radical SAM superfamily 100951 Helix-loop-helix DNA-binding domain 100972 ADP-ribosylation factor family 100972 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 101023 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 101185 Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. A 101187 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 101202 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 101240 WD domain, G-beta repeat 101320 Protein kinase domain 101358 F-box domain 101488 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 101488 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 101488 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 101488 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 101490 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 101502 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 101513 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 101533 Trypsin 101540 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 101568 Protein kinase domain 101592 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 101592 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 101613 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 101700 B-box zinc finger 101700 Zinc finger, C3HC4 type (RING finger) 101706 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 101739 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 101744 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte 101744 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 101860 Sulfotransferase protein 101861 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 101861 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 101943 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p 101964 Sterile alpha motif (SAM)/Pointed domain 101964 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 101994 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 102022 Carboxylesterase 102060 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 102075 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 102141 PX domain. PX domains bind to phosphoinositides 102141 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 102278 C2 domain 102294 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 102448 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 102502 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 102570 Sugar (and other) transporter 102580 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 102607 PX domain. PX domains bind to phosphoinositides 102626 Protein kinase domain 102680 Sodium:neurotransmitter symporter family 102774 TPR Domain 102857 Sodium:neurotransmitter symporter family 102926 Peptidase family C54 102926 Peptidase family C54 103098 Sodium:neurotransmitter symporter family 103135 Ubiquitin carboxyl-terminal hydrolase family 2 103135 Ubiquitin carboxyl-terminal hydrolase family 2 103135 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 103142 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 103149 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 103199 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 103236 Protein kinase domain 103296 Protein kinase domain 103468 Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis 103468 Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis 103537 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino te 103655 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 103694 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 103775 Mitochondrial carrier protein 103784 WD domain, G-beta repeat 103841 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 103968 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 103968 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 103978 Glypican 103978 Glypican 103988 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 104001 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 104009 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 104010 Cadherin domain 104010 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 104069 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 104082 WD domain, G-beta repeat 104086 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 104099 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 104110 Adenylate and Guanylate cyclase catalytic domain 104111 Adenylate and Guanylate cyclase catalytic domain 104112 Citrate synthase 104112 CoA binding domain. This domain has a Rossmann fold and is found in a number of proteins including succinyl CoA synthetases, malate and ATP-citrate ligases 104112 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 104156 Ets-domain 104158 Carboxylesterase 104174 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated 104183 Glycosyl hydrolases family 18 104184 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 104184 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 104245 Sodium:neurotransmitter symporter family 104263 jmjC domain 104303 ADP-ribosylation factor family 104318 Protein kinase domain 104318 Protein kinase domain 104348 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 104348 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 104349 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 104360 LIM domain. This family represents two copies of the LIM structural domain 104383 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 104384 Homeobox domain 104394 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 104416 Ubiquitin carboxyl-terminal hydrolase, family 1 104418 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 104418 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 104418 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 104443 7 transmembrane receptor (rhodopsin family) 104625 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 104652 Myosin head (motor domain) 104718 Coatomer WD associated domain 104759 Phospholipase D. Active site motif. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certa 104776 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 104816 Asparaginase 104910 Mitochondrial carrier protein 104923 ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be 105014 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 105033 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 105072 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 105245 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 105246 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 105349 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 105372 WD domain, G-beta repeat 105387 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 105418 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family includ 105446 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 105446 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 105540 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 105540 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 105594 Homeobox domain 105594 Homeobox domain 105663 Adenylate cyclase. One member was found to complement mutants of E. coli. cya. This protein has been shown to be an adenylate kinase 105675 Cyclophilin type peptidyl-prolyl cis-trans isomerase 105727 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 105734 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 105734 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 105734 CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues, 105734 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 105734 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 105785 ER lumen protein retaining receptor 105787 Protein kinase domain 105835 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 105866 Intermediate filament protein 105887 UDP-glucoronosyl and UDP-glucosyl transferase 105892 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 106039 VHS domain. Domain present in VPS-27, Hrs and STAM 106039 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 106039 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 106052 F-box domain 106112 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 106112 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 106151 Oxysterol-binding protein 106200 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 106200 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 106248 Queuine tRNA-ribosyltransferase. This is a family of queuine tRNA-ribosyltransferases EC:2.4.2.29, also known as tRNA-guanine transglycosylase and guanine insertion enzyme. Queuine tRNA-ribosyltransferase modifies tRNAs for asparagine, aspartic acid, hist 106326 Oxysterol-binding protein 106369 Yippee putative zinc-binding protein 106407 Domain of unknown function 106522 Protein kinase domain 106529 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 106564 DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism 106572 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 106583 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 106597 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 106618 WD domain, G-beta repeat 106628 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 106648 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 106794 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 106794 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 106794 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 106795 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 106877 PH domain. PH stands for pleckstrin homology 106877 PH domain. PH stands for pleckstrin homology 106947 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 106952 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 106957 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 107141 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 107182 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 107182 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 107182 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 107221 7 transmembrane receptor (rhodopsin family) 107221 7 transmembrane receptor (rhodopsin family) 107250 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 107250 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 107257 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 107271 tRNA synthetases class I (W and Y) 107271 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act 107271 tRNA synthetases class I (W and Y) 107271 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 107272 Aminotransferase class-V 107305 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 107321 LIM domain. This family represents two copies of the LIM structural domain 107328 Domain of unknown function DUF60. The proteins in this family have no known function. Two proteins have been annotated as phosphotransferases, however this is not supported experimentally, and should be viewed with caution 107338 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 107351 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 107351 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 107358 Endomembrane protein 70 107358 Endomembrane protein 70 107368 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 107371 Exocyst complex subunit Sec15-like 107375 Mitochondrial carrier protein 107393 Lectin C-type domain. This family includes both long and short form C-type 107393 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 107476 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 107508 WHEP-TRS domain 107508 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 107508 tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases 107508 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 107508 tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a 107508 tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla 107513 Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across 107527 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 107528 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 107568 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 107568 C2 domain 107568 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 107568 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 107581 Thrombospondin N-terminal -like domain 107585 Iodothyronine deiodinase 107589 Protein kinase domain 107589 Fibronectin type III domain 107589 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 107605 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 107607 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 107607 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 107607 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 107607 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 107650 Phosphatidylinositol 3- and 4-kinase 107652 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri 107684 WD domain, G-beta repeat 107686 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 107702 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 107723 K-Cl Co-transporter type 1 (KCC1) 107747 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 107751 OAR domain 107751 Homeobox domain 107753 Galactoside-binding lectin 107767 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy 107771 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 107797 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 107798 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 107817 jmjC domain 107823 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 107823 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 107823 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 107831 7 transmembrane receptor (Secretin family) 107831 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 107831 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 107869 Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA 107885 5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF 107889 GCM motif protein 107932 'chromo' (CHRromatin Organization MOdifier) domain 107932 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 107932 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 107932 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 107934 7 transmembrane receptor (Secretin family) 107934 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 107934 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 107970 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 107971 PTB domain (IRS-1 type) 108000 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 108000 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 108000 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 108011 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 108011 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 108012 Clathrin adaptor complex small chain 108013 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 108013 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 108015 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 108015 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 108015 Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 108015 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 108015 Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 108015 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 108017 ATP1G1/PLM/MAT8 family 108030 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 108030 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 108037 Serine hydroxymethyltransferase 108043 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 108043 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 108043 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 108043 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 108052 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 108062 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 108062 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 108067 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 108067 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-octu 108069 7 transmembrane receptor (metabotropic glutamate family) 108069 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 108069 7 transmembrane receptor (metabotropic glutamate family) 108071 7 transmembrane receptor (metabotropic glutamate family) 108071 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 108072 7 transmembrane receptor (metabotropic glutamate family) 108072 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 108073 7 transmembrane receptor (metabotropic glutamate family) 108073 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 108075 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 108075 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 108077 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 108079 Protein kinase domain 108083 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 108089 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 108096 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 108096 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 108105 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 108114 Sugar (and other) transporter 108115 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 108115 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 108116 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 108116 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 108124 NSF attachment protein 108143 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra 108148 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 108150 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 108151 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 108151 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 108154 Thrombospondin type 1 domain 108154 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 108154 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 108156 Formate--tetrahydrofolate ligase 108159 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 108159 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 108657 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 108660 Zinc finger 108664 V-ATPase subunit H 108672 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 108686 Intermediate filament protein 108686 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 108687 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 108699 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 108699 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 108699 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 108723 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 108760 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 108800 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 108800 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 108802 Calreticulin family 108837 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 108837 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 108841 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 108841 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 108853 PCRF domain. This domain is found in peptide chain release factors 108853 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 108888 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 108888 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 108898 NAD-dependent glycerol-3-phosphate dehydrogenase 108898 NAD-dependent glycerol-3-phosphate dehydrogenase 108943 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 108946 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 108960 Death domain 108960 Protein kinase domain 108960 Death domain 108960 Protein kinase domain 108995 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 109032 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 109032 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 109052 Intermediate filament protein 109052 Intermediate filament protein 109052 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 109052 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 109075 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 109093 tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase 109108 XPA protein 109108 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 109108 XPA protein 109108 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 109113 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 109113 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 109113 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 109115 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon 109115 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 109137 ADP-ribosylation factor family 109137 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 109161 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 109161 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 109168 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 109225 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 109225 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 109232 Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopine 109246 Tetraspanin family 109246 Tetraspanin family 109246 Tetraspanin family 109264 Malic enzyme, NAD binding domain 109264 Malic enzyme, N-terminal domain 109264 Malic enzyme, NAD binding domain 109267 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 109272 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 109280 Sugar (and other) transporter 109310 Ubiquitin carboxyl-terminal hydrolase family 2 109323 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 109331 Zinc finger, C3HC4 type (RING finger) 109333 Hr1 repeat 109333 Protein kinase domain 109333 Protein kinase C terminal domain 109342 Sodium:solute symporter family 109346 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 109346 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 109594 LIM domain. This family represents two copies of the LIM structural domain 109596 IQ calmodulin-binding motif. Calmodulin-binding motif 109624 Caldesmon 109637 Tetraspanin family 109648 Pancreatic hormone peptide 109652 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 109672 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 109674 Adenosine/AMP deaminase 109676 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 109676 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 109676 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 109685 Hyaluronidase 109685 Hyaluronidase 109685 RolB/RolC glucosidase family. This family of proteins includes RolB and RolC. RolC releases cytokinins from glucoside conjugates. Whereas RolB hydrolyses indole glucosides 109689 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 109689 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 109689 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 109689 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 109697 Zinc carboxypeptidase 109697 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 109711 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 109711 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 109731 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 109754 Oxidoreductase FAD-binding domain 109778 Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD 109801 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 109820 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 109857 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 109880 B-box zinc finger 109880 Protein kinase domain 109901 Trypsin 109904 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 109929 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 109929 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 109978 NAD:arginine ADP-ribosyltransferase 109979 NAD:arginine ADP-ribosyltransferase 110006 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 110006 Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold 110052 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 110058 C2 domain 110074 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 110075 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 110078 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 110082 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 110095 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 110109 NOL1/NOP2/sun family 110119 Phosphomannose isomerase type I 110135 Fibrinogen beta and gamma chains, C-terminal globular domain 110147 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 110147 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 110147 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 110147 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 110157 Protein kinase domain 110157 Raf-like Ras-binding domain 110157 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 110168 7 transmembrane receptor (rhodopsin family) 110196 Polyprenyl synthetase 110197 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 110197 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 110213 Uncharacterized protein family UPF0005 110253 PH domain. PH stands for pleckstrin homology 110279 C2 domain 110279 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 110279 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 110304 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110304 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 110308 Intermediate filament protein 110310 Intermediate filament protein 110323 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 110326 7 transmembrane receptor (metabotropic glutamate family) 110326 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 110351 Rap/ran-GAP 110351 LGN motif, putative GEF specific for G-alpha GTPase 110355 Protein kinase domain 110355 PH domain. PH stands for pleckstrin homology 110355 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 110382 Low-density lipoprotein receptor domain class A 110382 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 110382 Thrombospondin type 1 domain 110382 Low-density lipoprotein receptor domain class A 110382 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 110446 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 110521 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 110524 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 110524 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 110524 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 110532 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 110542 Protein kinase domain 110542 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 110595 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 110596 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 110616 Josephin 110637 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 110637 Bacterial extracellular solute-binding proteins, family 3 110637 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 110637 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 110648 Homeobox domain 110648 LIM domain. This family represents two copies of the LIM structural domain 110695 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 110750 CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis 110751 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 110751 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 110789 Calx-beta domain 110805 Fork head domain 110805 Ribosomal L29e protein family 110809 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 110816 Periodic tryptophan protein 2 WD repeat associated domain 110816 WD domain, G-beta repeat 110816 Periodic tryptophan protein 2 WD repeat associated domain 110829 LIM domain. This family represents two copies of the LIM structural domain 110834 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110834 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 110835 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110835 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 110835 Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110835 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110854 Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is 110855 3'5'-cyclic nucleotide phosphodiesterase 110862 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 110862 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 110880 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 110886 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110886 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 110886 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110886 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 110891 Calx-beta domain 110891 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 110893 Calx-beta domain 110893 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 110902 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 110902 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 110911 Phosphatidate cytidylyltransferase 110920 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 110920 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 110948 Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for 110954 Ribosomal L10 110956 ENV polyprotein (coat polyprotein) 110956 ENV polyprotein (coat polyprotein) 111174 7 transmembrane receptor (rhodopsin family) 111175 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 112400 Neuregulin family 112417 UDP-glucoronosyl and UDP-glucosyl transferase 112483 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 112574 PX domain. PX domains bind to phosphoinositides 112714 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 112714 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 112755 Syntaxin 112802 Intermediate filament protein 112885 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 112936 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 112937 Glycosyl hydrolases family 35 112939 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 113026 PH domain. PH stands for pleckstrin homology 113178 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 113189 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 113220 Kinesin motor domain 113251 GTPase of unknown function 113419 Protein of unknown function (DUF396). A family of conserved eukaryotic transmembrane proteins 113451 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 113457 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 113457 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 113612 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 113622 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi 113730 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 113829 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 113845 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113846 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113847 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113848 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113849 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113849 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113849 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113850 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113850 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113850 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113851 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113851 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113852 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113852 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113852 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113853 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113853 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113854 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113855 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113856 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113857 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113858 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113859 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113860 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113861 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113861 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113862 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113863 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113863 7 transmembrane receptor (rhodopsin family) 113863 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113864 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113864 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113865 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113867 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 113886 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 113892 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 113893 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid 113894 Octicosapeptide repeat. Short motif that may bind Ca2+ 113894 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 113901 Glycosyl hydrolases family 18 113901 Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse 113902 Carboxylesterase 113906 Hsp20/alpha crystallin family 113910 Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the no 113911 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 113914 7 transmembrane receptor (rhodopsin family) 113918 Sodium:neurotransmitter symporter family 113927 Protein kinase domain 113936 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo 113940 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 113940 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 113947 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 113956 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 113958 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 113959 7 transmembrane receptor (rhodopsin family) 113960 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 113960 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 113970 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 113983 Homeobox domain 113984 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 113984 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 113992 UDP-glucoronosyl and UDP-glucosyl transferase 113995 ATP P2X receptor 114002 Sulfotransferase protein 114020 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 114022 Glutaredoxin 114023 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 114027 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 114032 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 114032 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 114087 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 114088 Fibronectin type III domain 114088 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 114088 Fibronectin type III domain 114089 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 114091 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 114094 Ubiquitin carboxyl-terminal hydrolase, family 1 114095 Sec1 family 114096 Isocitrate/isopropylmalate dehydrogenase 114097 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 114098 7 transmembrane receptor (rhodopsin family) 114099 7 transmembrane receptor (rhodopsin family) 114100 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 114101 Phosphatidate cytidylyltransferase 114102 Cadherin domain 114105 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 114106 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 114107 WSC domain. This domain may be involved in carbohydrate binding 114108 LIM domain. This family represents two copies of the LIM structural domain 114109 Homeobox domain 114112 Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases 114112 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 114113 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu 114114 Dynamin GTPase effector domain 114114 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 114115 ATP P2X receptor 114116 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 114116 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 114119 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 114124 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 114128 Golgi 4-transmembrane spanning transporter 114134 Sugar (and other) transporter 114141 PMP-22/EMP/MP20/Claudin family 114142 Fork head domain 114213 Homeobox domain 114214 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 114216 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 114228 Trypsin 114229 7 transmembrane receptor (rhodopsin family) 114243 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 114244 Cyclic nucleotide-binding domain 114244 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 114245 Cyclic nucleotide-binding domain 114245 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 114246 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 114299 Paralemmin 114301 Paralemmin 114304 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 114332 Extracellular link domain 114335 Cystine-knot domain 114336 Cystine-knot domain 114481 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 114482 Protein kinase domain 114482 Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element 114493 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 114494 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 114494 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 114497 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 114499 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 114500 Homeobox domain 114502 Homeobox domain 114502 'Paired box' domain 114503 Homeobox domain 114504 Homeobox domain 114507 Sodium:solute symporter family 114508 Fructose-1-6-bisphosphatase 114513 PH domain. PH stands for pleckstrin homology 114513 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 114516 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 114516 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 114516 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 114521 Uracil DNA glycosylase superfamily 114522 Stromal antigen (SA/STAG) protein 114523 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 114524 Laminin N-terminal (Domain VI) 114524 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 114548 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 114553 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 114555 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 114558 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso 114561 Phosphatidylinositol transfer protein 114562 Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90 114564 7 transmembrane receptor (rhodopsin family) 114564 7 transmembrane receptor (rhodopsin family) 114565 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 114566 Intermediate filament protein 114570 LIM domain. This family represents two copies of the LIM structural domain 114570 LIM domain. This family represents two copies of the LIM structural domain 114571 Sugar (and other) transporter 114589 Fibronectin type III domain 114590 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 114591 Peptidase family M49 114593 Atrial natriuretic peptide 114598 Lectin C-type domain. This family includes both long and short form C-type 114601 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 114601 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 114602 MYND finger 114629 Sulfate transporter family. Mutations may lead to several human diseases 114629 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 114629 Sulfate transporter family. Mutations may lead to several human diseases 114629 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 114630 ATP synthase subunit C 114632 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 114633 C2 domain 114633 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 114634 Homeobox domain 114639 Zona pellucida-like domain 114641 Ribosomal protein L31e 114644 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 114663 Inositol monophosphatase family 114664 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 114664 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 114671 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 114671 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 114674 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 114675 Caldesmon 114675 WD domain, G-beta repeat 114675 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 114700 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 114705 Raf-like Ras-binding domain 114712 Helix-loop-helix DNA-binding domain 114757 Globin 114767 Protein-tyrosine phosphatase 114767 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 114769 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 114780 Galactose binding lectin domain 114780 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 114780 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 114780 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 114780 Galactose binding lectin domain 114780 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 114780 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 114780 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 114780 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 114781 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 114782 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 114783 Protein kinase domain 114784 CUB domain 114784 Sushi domain (SCR repeat) 114784 Sushi domain (SCR repeat) 114788 CUB domain 114788 Sushi domain (SCR repeat) 114788 Sushi domain (SCR repeat) 114789 Mitochondrial carrier protein 114791 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 114792 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 114794 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 114798 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 114803 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 114805 QXW lectin repeat 114805 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 114814 7 transmembrane receptor (rhodopsin family) 114815 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 114819 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 114821 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 114821 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 114822 Hr1 repeat 114822 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 114823 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 114825 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 114826 MYND finger 114845 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 114846 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 114851 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 114856 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 114859 Protein kinase domain 114861 Serine carboxypeptidase 114874 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester 114874 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 114876 Oxysterol-binding protein 114876 Oxysterol-binding protein 114876 Oxysterol-binding protein 114879 Oxysterol-binding protein 114879 Oxysterol-binding protein 114880 Oxysterol-binding protein 114880 Oxysterol-binding protein 114881 Oxysterol-binding protein 114881 Oxysterol-binding protein 114882 Oxysterol-binding protein 114883 Oxysterol-binding protein 114883 Oxysterol-binding protein 114883 Oxysterol-binding protein 114883 Oxysterol-binding protein 114883 Oxysterol-binding protein 114883 Oxysterol-binding protein 114883 Oxysterol-binding protein 114884 Oxysterol-binding protein 114885 Oxysterol-binding protein 114886 Globin 114889 Homeobox domain 114893 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 114893 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 114895 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 114897 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 114899 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 114901 Sorbin homologous domain 114903 PMP-22/EMP/MP20/Claudin family 114906 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 114971 Protein-tyrosine phosphatase 114971 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 115004 Mab-21 protein 115004 Mab-21 protein 115019 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 115019 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 115131 7 transmembrane receptor (rhodopsin family) 115154 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 115196 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 115201 Peptidase family C54 115201 Peptidase family C54 115201 Peptidase family C54 115207 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 115273 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 115281 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 115286 Mitochondrial carrier protein 115290 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 115290 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 115294 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 115330 7 transmembrane receptor (rhodopsin family) 115361 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 115361 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 115362 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 115362 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 115426 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 115426 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 115426 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 115426 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 115426 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 115426 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 115442 Protein-tyrosine phosphatase 115509 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 115548 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 115584 Sodium:solute symporter family 115669 TPR Domain 115701 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 115727 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 115727 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 115727 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 115761 ADP-ribosylation factor family 115770 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 115817 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 115827 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 115908 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 115939 Domain of unknown function (DUF367) 115939 Possible metal-binding domain in RNase L inhibitor, RLI. Possible metal-binding domain in endoribonuclease RNase L inhibitor. Found at the N-terminal end of RNase L inhibitor proteins, adjacent to the 4Fe-4S binding domain, fer4, pfam00037. Also often fou 116068 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 116085 Sugar (and other) transporter 116085 Sugar (and other) transporter 116123 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 116166 Zinc-binding dehydrogenase 116179 Transglutaminase family 116179 Transglutaminase family, C-terminal ig like domain 116179 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 116225 MYND finger 116255 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 116328 Mechanosensitive ion channel. Two members of this protein family of M. jannaschii have been functionally characterized. Both proteins form mechanosensitive (MS) ion channels upon reconstitution into liposomes and functional examination by the patch-clamp 116362 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 116369 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 116369 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 116379 Tissue factor 116379 Tissue factor 116379 Tissue factor 116412 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 116444 Bacterial extracellular solute-binding proteins, family 3 116444 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 116448 Helix-loop-helix DNA-binding domain 116456 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 116463 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 116466 wnt family 116469 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 116470 Syntaxin 116474 Somatotropin hormone family 116481 HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins 116482 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 116484 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 116486 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 116486 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 116487 Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria 116491 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 116498 7 transmembrane receptor (rhodopsin family) 116499 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 116499 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 116502 Apoptosis regulator proteins, Bcl-2 family 116506 7 transmembrane receptor (Secretin family) 116509 Sodium:neurotransmitter symporter family 116510 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 116511 7 transmembrane receptor (rhodopsin family) 116519 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 116534 7 transmembrane receptor (rhodopsin family) 116543 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 116544 Pou domain - N-terminal to homeobox domain 116547 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 116548 Furin-like cysteine rich region 116548 Giardia variant-specific surface protein 116548 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d 116548 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont 116549 Protein kinase domain 116551 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 116555 Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis 116556 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 116557 Fork head domain 116558 Interleukin 7/9 family. IL-7 is a cytokine that acts as a growth factor for early lymphoid cells of both B- and T-cell lineages. IL-9 is a multi-functional cytokine that, although originally described as a T-cell growth factor, its function in T-cell resp 116560 Inward rectifier potassium channel 116561 Clathrin light chain 116562 Interleukin 2 116563 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 116564 Protein kinase domain 116564 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 116568 Gamma-glutamyltranspeptidase 116569 Glycosyl hydrolase family 1 116589 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 116592 Trefoil (P-type) domain 116593 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 116598 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 116599 Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD 116600 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 116631 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 116632 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation 116637 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 116639 Homeobox domain 116639 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 116642 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 116643 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o 116644 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 116646 Translationally controlled tumor protein 116647 Carboxylesterase 116648 Homeobox domain 116649 Inward rectifier potassium channel 116649 Inward rectifier potassium channel 116651 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 116653 CAP protein 116654 Sec8 exocyst complex component specific domain 116655 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 116658 Fibronectin type III domain 116663 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 116664 ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F 116665 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 116666 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19- 116668 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 116671 Cyclin-dependent kinase 5 activator protein 116672 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 116673 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 116676 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 116677 7 transmembrane receptor (rhodopsin family) 116681 Guanylate-kinase-associated protein (GKAP) protein 116683 Ephrin 116684 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 116685 Intermediate filament protein 116685 Intermediate filament tail domain 116689 SH2 domain 116690 Fibronectin type III domain 116691 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 116692 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 116693 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 116694 Calpain family cysteine protease 116694 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 116696 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 116700 Glycoprotein hormone 116717 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 116719 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 116720 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 116723 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata 116724 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 116727 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 116727 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 116740 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 116742 Cadherin domain 116743 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 116744 7 transmembrane receptor (rhodopsin family) 116745 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 116748 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli 116777 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 116780 Cadherin domain 116782 Cadherin domain 116808 Cadherin domain 116831 ATP1G1/PLM/MAT8 family 116838 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 116839 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 116841 SNAP-25 family. SNAP-25 (synaptosome-associated protein 25 kDa) proteins are components of SNARE complexes. Members of this family contain a cluster of cysteine residues that can be palmitoylated for membrane attachment 116844 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 116848 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 116848 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 116849 Interleukin 10 116852 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 116870 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 116870 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 116871 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 116872 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 116873 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 116904 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 116905 Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s 116914 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog 116939 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 116940 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 116967 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions 116969 NAD:arginine ADP-ribosyltransferase 116983 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 116984 PH domain. PH stands for pleckstrin homology 116985 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 116985 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 116986 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 116986 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 116987 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 116987 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 116987 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 116988 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 116996 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 116998 Pentaxin family. Pentaxins are also known as pentraxins 116999 Ribosomal protein L36e 116999 Zinc finger, C3HC4 type (RING finger) 116999 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 117001 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 117001 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 117004 7 transmembrane receptor (rhodopsin family) 117005 7 transmembrane receptor (rhodopsin family) 117016 Kinase associated domain 1 117018 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 117018 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117019 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 117021 Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein 117023 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 117023 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117024 Ornithine cyclodeaminase/mu-crystallin family. This family contains the bacterial Ornithine cyclodeaminase enzyme EC:4.3.1.12, which catalyses the deamination of ornithine to proline. This family also contains mu-Crystallin the major component of the eye 117025 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 117026 Phosphotyrosine interaction domain (PTB/PID) 117027 7 transmembrane receptor (rhodopsin family) 117028 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 117029 7 transmembrane receptor (rhodopsin family) 117033 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 117033 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 117033 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 117034 Helix-loop-helix DNA-binding domain 117035 Mitochondrial carrier protein 117037 Hyaluronidase 117042 Ribosomal protein L6e 117042 Ribosomal protein L6, N-terminal domain 117043 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 117045 Eukaryotic initiation factor 4E 117046 Intermediate filament protein 117048 Cadherin domain 117050 ADP-ribosylation factor family 117051 ADP-ribosylation factor family 117052 Inward rectifier potassium channel 117053 Ribosomal protein S8 117055 Connexin 117061 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 117061 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 117061 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 117065 Homeobox domain 117088 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e 117094 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 117095 Helix-loop-helix DNA-binding domain 117096 Carboxylesterase 117097 7 transmembrane receptor (rhodopsin family) 117098 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 117099 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 117100 Sodium:neurotransmitter symporter family 117101 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 117101 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117103 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 117105 Kv2 voltage-gated K+ channel 117105 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 117105 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117106 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 117107 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 117108 Glycosyltransferase family 43 117109 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 117140 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 117144 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117148 Uncharacterized ACR, YneC family COG1359 117148 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ 117148 Uncharacterized ACR, YneC family COG1359 117152 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 117154 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 117155 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117155 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117155 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 117177 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio 117179 CUB domain 117180 'Cold-shock' DNA-binding domain 117182 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 117194 7 transmembrane receptor (rhodopsin family) 117195 7 transmembrane receptor (rhodopsin family) 117196 7 transmembrane receptor (rhodopsin family) 117198 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 117198 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 117229 Protein kinase domain 117229 Protein kinase domain 117232 Homeobox domain 117240 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 117241 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 117241 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 117242 Plant PEC family metallothionein 117248 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 117252 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 117257 7 transmembrane receptor (rhodopsin family) 117258 Fibronectin type III domain 117260 NUDIX domain 117261 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 117264 7 transmembrane receptor (rhodopsin family) 117267 Sulfate transporter family. Mutations may lead to several human diseases 117270 Trefoil (P-type) domain 117274 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 117276 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 117279 Death effector domain 117280 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 117281 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 117283 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 117287 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 117505 LIM domain. This family represents two copies of the LIM structural domain 117512 Low-density lipoprotein receptor domain class A 117512 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 117513 Synaptobrevin 117515 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 117519 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 117523 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 117526 Transcription factor TFIID (or TATA-binding protein, TBP) 117533 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 117537 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 117537 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 117540 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 117548 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 117548 Kinesin motor domain 117548 PH domain. PH stands for pleckstrin homology 117548 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 117549 7 transmembrane receptor (rhodopsin family) 117550 Kinesin motor domain 117550 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 117553 Repeat in ubiquitin-activating (UBA) protein 117553 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 117554 Lipase 117556 Sugar (and other) transporter 117557 Tropomyosin 117559 Sugar (and other) transporter 117581 Helix-loop-helix DNA-binding domain 117591 Sugar (and other) transporter 117592 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 117600 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 117600 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 117606 Fibronectin type III domain 117854 B-box zinc finger 117854 Zinc finger, C3HC4 type (RING finger) 117854 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 117955 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 118424 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 118427 Olfactomedin-like domain 118446 Connexin 118453 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 118453 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 118453 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 118453 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 118453 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 118453 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 118454 Connexin 118454 Connexin 118454 Connexin 118704 PH domain. PH stands for pleckstrin homology 118704 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 118704 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 118706 PH domain. PH stands for pleckstrin homology 118706 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 118706 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 118709 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 118856 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 118881 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 118933 ab-hydrolase associated lipase region 118933 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 118934 Intermediate filament protein 118934 ab-hydrolase associated lipase region 118934 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 118945 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 118980 Tricarboxylate carrier 118980 Tricarboxylate carrier 118987 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 118992 Homeobox domain 119016 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 119016 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 119180 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 119358 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 119391 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 119391 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 119548 Lipase 119548 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 119558 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 119558 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 119559 Tricarboxylate carrier 119559 Tricarboxylate carrier 119563 Ribosomal protein S8e 119587 Zinc carboxypeptidase 119587 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 119676 7 transmembrane receptor (rhodopsin family) 119678 7 transmembrane receptor (rhodopsin family) 119679 7 transmembrane receptor (rhodopsin family) 119682 7 transmembrane receptor (rhodopsin family) 119683 7 transmembrane receptor (rhodopsin family) 119684 7 transmembrane receptor (rhodopsin family) 119685 7 transmembrane receptor (rhodopsin family) 119686 7 transmembrane receptor (rhodopsin family) 119687 7 transmembrane receptor (rhodopsin family) 119689 7 transmembrane receptor (rhodopsin family) 119690 7 transmembrane receptor (rhodopsin family) 119691 7 transmembrane receptor (rhodopsin family) 119692 7 transmembrane receptor (rhodopsin family) 119694 7 transmembrane receptor (rhodopsin family) 119695 7 transmembrane receptor (rhodopsin family) 119696 7 transmembrane receptor (rhodopsin family) 119722 Intermediate filament protein 119749 7 transmembrane receptor (rhodopsin family) 119750 7 transmembrane receptor (rhodopsin family) 119764 7 transmembrane receptor (rhodopsin family) 119765 7 transmembrane receptor (rhodopsin family) 119766 Ribosomal protein L21e 119770 7 transmembrane receptor (rhodopsin family) 119772 7 transmembrane receptor (rhodopsin family) 119774 7 transmembrane receptor (rhodopsin family) 119775 7 transmembrane receptor (rhodopsin family) 120061 7 transmembrane receptor (rhodopsin family) 120062 7 transmembrane receptor (rhodopsin family) 120065 7 transmembrane receptor (rhodopsin family) 120066 7 transmembrane receptor (rhodopsin family) 120071 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 120082 jmjN domain 120082 jmjC domain 120084 jmjN domain 120084 jmjC domain 120088 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 120103 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 120105 Cadherin domain 120123 Zinc finger, C3HC4 type (RING finger) 120123 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 120126 HMG (high mobility group) box 120127 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 120128 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 120144 HMG (high mobility group) box 120145 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 120146 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 120227 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 120237 Homeobox domain 120318 Ribosomal protein S6e 120323 Ribosomal protein L21e 120332 ICE-like protease (caspase) p20 domain 120332 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 120356 Ribosomal protein S2 120364 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 120449 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 120526 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 120528 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 120586 7 transmembrane receptor (rhodopsin family) 120643 7 transmembrane receptor (rhodopsin family) 120718 14-3-3 protein 120775 7 transmembrane receptor (rhodopsin family) 120776 7 transmembrane receptor (rhodopsin family) 120787 7 transmembrane receptor (rhodopsin family) 120793 Intermediate filament protein 120793 7 transmembrane receptor (rhodopsin family) 120796 7 transmembrane receptor (rhodopsin family) 120824 B-box zinc finger 120824 Zinc finger, C3HC4 type (RING finger) 120829 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 120864 Dehydratase family 120864 Ribosomal family S4e 120872 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 120892 Protein kinase domain 120892 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 120914 Ribosomal protein L31e 120940 Lectin C-type domain. This family includes both long and short form C-type 120943 Aminotransferase class I and II 120950 Ribosomal protein S26e 121021 Laminin G domain 121054 Intermediate filament protein 121071 Zinc-binding dehydrogenase 121071 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 121078 Ribosomal protein L13 121129 7 transmembrane receptor (rhodopsin family) 121130 7 transmembrane receptor (rhodopsin family) 121155 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 121214 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 121216 Trefoil (P-type) domain 121216 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 121227 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 121260 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 121270 7 transmembrane receptor (rhodopsin family) 121275 7 transmembrane receptor (rhodopsin family) 121278 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 121340 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 121360 7 transmembrane receptor (rhodopsin family) 121364 7 transmembrane receptor (rhodopsin family) 121391 Intermediate filament protein 121456 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 121498 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 121498 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 121512 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 121519 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 121520 Aminotransferase class I and II 121549 Helix-loop-helix DNA-binding domain 121550 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 121551 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 121665 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 121734 Cyclophilin type peptidyl-prolyl cis-trans isomerase 121790 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 121790 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinit 121906 Proteasome A-type and B-type 121916 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 121940 ADP-ribosylation factor family 121940 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 121981 Cyclophilin type peptidyl-prolyl cis-trans isomerase 122011 Protein kinase domain 122013 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 122013 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 122042 7 transmembrane receptor (rhodopsin family) 122055 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 122107 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 122128 Ribosomal L18ae protein family 122134 Protein kinase domain 122201 START domain 122217 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 122335 Cyclophilin type peptidyl-prolyl cis-trans isomerase 122402 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 122402 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 122500 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 122501 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 122524 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 122529 Intermediate filament protein 122553 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 122570 Ribosomal protein L6 122585 Ribosomal protein L23 122589 Ribosomal protein L10 122592 Intermediate filament protein 122622 Adenylosuccinate synthetase 122649 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 122651 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 122703 7 transmembrane receptor (rhodopsin family) 122706 Proteasome A-type and B-type 122718 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 122739 7 transmembrane receptor (rhodopsin family) 122740 7 transmembrane receptor (rhodopsin family) 122741 7 transmembrane receptor (rhodopsin family) 122742 7 transmembrane receptor (rhodopsin family) 122744 7 transmembrane receptor (rhodopsin family) 122745 7 transmembrane receptor (rhodopsin family) 122747 7 transmembrane receptor (rhodopsin family) 122748 7 transmembrane receptor (rhodopsin family) 122749 7 transmembrane receptor (rhodopsin family) 122750 7 transmembrane receptor (rhodopsin family) 122772 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 122783 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 122788 Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va 122799 S25 ribosomal protein 122830 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 122867 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 122873 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 122927 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 122950 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 122961 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 123031 Ribosomal protein L21e 123041 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 123041 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 123041 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 123096 Mitochondrial carrier protein 123099 Fatty acid desaturase 123103 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 123103 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 123146 Intermediate filament protein 123169 Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription 123228 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 123257 HMG14 and HMG17 123278 7 transmembrane receptor (rhodopsin family) 123279 7 transmembrane receptor (rhodopsin family) 123374 Core histone H2A/H2B/H3/H4 123396 Intermediate filament protein 123396 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 123397 Ribosomal L29e protein family 123442 Intermediate filament protein 123722 Fibronectin type III domain 123722 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 123745 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 123788 Trypsin 123876 AMP-binding enzyme 123879 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 123887 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 124132 Ribosomal protein S8 124149 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 124199 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 124199 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 124199 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 124221 Trypsin 124222 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 124274 7 transmembrane receptor (rhodopsin family) 124321 Eukaryotic ribosomal protein L18 124401 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 124401 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 124404 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 124411 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 124451 Homeobox domain 124451 LIM domain. This family represents two copies of the LIM structural domain 124454 tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla 124460 PX domain. PX domains bind to phosphoinositides 124461 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 124497 Homeobox domain 124535 HSF-type DNA-binding 124538 7 transmembrane receptor (rhodopsin family) 124555 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 124563 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 124565 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 124602 Kinesin motor domain 124637 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 124641 Domain of unknown function (DUF341) 124663 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 124664 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 124667 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 124751 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 124760 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 124817 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 124827 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 124858 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 124872 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 124912 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme 124925 Sushi domain (SCR repeat) 124925 CUB domain 124925 Sushi domain (SCR repeat) 124936 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 124975 Gamma-glutamyltranspeptidase 124976 Sugar (and other) transporter 125058 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 125110 Cyclophilin type peptidyl-prolyl cis-trans isomerase 125111 Connexin 125113 Intermediate filament protein 125115 Intermediate filament protein 125115 Intermediate filament protein 125115 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 125182 7 transmembrane receptor (rhodopsin family) 125206 Sodium:solute symporter family 125228 Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown 125242 Intermediate filament protein 125306 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 125336 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 125338 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 125491 Cyclophilin type peptidyl-prolyl cis-trans isomerase 125491 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 125836 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 125850 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 125884 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 125910 Ribosomal protein S8 125950 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 125950 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 125958 7 transmembrane receptor (rhodopsin family) 125962 7 transmembrane receptor (rhodopsin family) 125963 7 transmembrane receptor (rhodopsin family) 125966 Tropomyosin 125972 Calreticulin family 125980 Adenylate kinase 126003 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 126005 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 126005 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 126006 LGN motif, putative GEF specific for G-alpha GTPase 126015 Homeobox domain 126037 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 126060 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 126070 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 126119 Josephin 126129 Choline/Carnitine o-acyltransferase 126147 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 126170 Cyclophilin type peptidyl-prolyl cis-trans isomerase 126204 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 126205 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 126206 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 126208 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 126235 Ribosomal family S4e 126242 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 126252 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 126295 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 126308 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 126341 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 126370 7 transmembrane receptor (rhodopsin family) 126374 LIM domain. This family represents two copies of the LIM structural domain 126393 Hsp20/alpha crystallin family 126410 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 126432 Vacuolar sorting protein 9 (VPS9) domain 126433 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 126433 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 126438 7 transmembrane receptor (rhodopsin family) 126439 7 transmembrane receptor (rhodopsin family) 126441 7 transmembrane receptor (rhodopsin family) 126476 Intermediate filament protein 126494 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 126504 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 126520 Protein kinase domain 126537 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 126538 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 126541 7 transmembrane receptor (rhodopsin family) 126549 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 126567 C2 domain 126576 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic 126598 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 126611 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 126636 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 126637 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 126668 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 126669 SH2 domain 126755 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 126792 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 126811 Intermediate filament protein 126823 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 126849 Adenine deaminase. Adenine deaminase EC:3.5.4.2 hydrolyses adenine to form hypoxanthine and ammonia. The enzyme is part of a large metal dependent hydrolase superfamily. Adenine deaminases reaction is important for adenine utilization as a purine and also 126860 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 126860 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 126860 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 126862 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 126868 Mab-21 protein 126905 Sushi domain (SCR repeat) 126917 Intermediate filament protein 126963 Ribosomal protein L6e 126963 Ribosomal protein L6, N-terminal domain 126971 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 126971 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 127018 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 127059 7 transmembrane receptor (rhodopsin family) 127062 7 transmembrane receptor (rhodopsin family) 127063 7 transmembrane receptor (rhodopsin family) 127064 7 transmembrane receptor (rhodopsin family) 127065 7 transmembrane receptor (rhodopsin family) 127066 7 transmembrane receptor (rhodopsin family) 127068 7 transmembrane receptor (rhodopsin family) 127069 7 transmembrane receptor (rhodopsin family) 127071 7 transmembrane receptor (rhodopsin family) 127074 7 transmembrane receptor (rhodopsin family) 127077 7 transmembrane receptor (rhodopsin family) 127086 Ribosomal protein L10 127086 Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi 127096 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 127099 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 127118 Ribosomal protein L23 127118 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 127118 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 127118 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 127124 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 127133 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 127148 7 transmembrane receptor (rhodopsin family) 127149 7 transmembrane receptor (rhodopsin family) 127150 Pentaxin family. Pentaxins are also known as pentraxins 127184 Polyprenyl synthetase 127223 Eukaryotic porin 127253 Uncharacterized ACR, COG1590 127257 Cyclophilin type peptidyl-prolyl cis-trans isomerase 127295 Ribosomal protein L36e 127300 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 127300 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 127324 Sugar (and other) transporter 127343 Homeobox domain 127343 Homeobox domain 127372 Ribosomal protein L15 127380 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 127385 7 transmembrane receptor (rhodopsin family) 127406 Ribosomal protein S2 127410 Cytochrome C oxidase subunit II, periplasmic domain 127534 Connexin 127540 HMG (high mobility group) box 127540 HMG (high mobility group) box 127545 Ribosomal L18ae protein family 127550 Glycosyltransferase family 6 127557 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 127572 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 127572 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 127572 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 127590 Ribosomal S3Ae family 127602 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 127608 7 transmembrane receptor (rhodopsin family) 127614 7 transmembrane receptor (rhodopsin family) 127617 7 transmembrane receptor (rhodopsin family) 127619 7 transmembrane receptor (rhodopsin family) 127621 7 transmembrane receptor (rhodopsin family) 127623 7 transmembrane receptor (rhodopsin family) 127707 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 127739 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 127739 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 127779 Poly-adenylate binding protein, unique domain 127784 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 127829 ADP-ribosylation factor family 127845 Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events 127850 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 127875 Ribosomal L29e protein family 127881 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 127884 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 127884 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 127887 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 127887 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 127933 Protein kinase domain 127933 Protein kinase domain 128066 Ribosomal protein S8 128100 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 128102 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 128103 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 128103 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 128136 Hydroxymethylglutaryl-coenzyme A synthase 128192 Cyclophilin type peptidyl-prolyl cis-trans isomerase 128209 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 128239 RasGAP C-terminus 128239 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 128240 YjeF-related protein N-terminus 128322 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 128337 7 transmembrane receptor (rhodopsin family) 128340 Glycosyl hydrolases family 18 128341 Glycosyl hydrolases family 18 128345 Glycosyl hydrolases family 18 128360 7 transmembrane receptor (rhodopsin family) 128361 7 transmembrane receptor (rhodopsin family) 128363 7 transmembrane receptor (rhodopsin family) 128365 7 transmembrane receptor (rhodopsin family) 128367 7 transmembrane receptor (rhodopsin family) 128368 7 transmembrane receptor (rhodopsin family) 128369 7 transmembrane receptor (rhodopsin family) 128370 7 transmembrane receptor (rhodopsin family) 128371 7 transmembrane receptor (rhodopsin family) 128372 7 transmembrane receptor (rhodopsin family) 128373 7 transmembrane receptor (rhodopsin family) 128387 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 128441 Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.` 128488 WAP-type (Whey Acidic Protein) 'four-disulfide core' 128646 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 128665 7 transmembrane receptor (rhodopsin family) 128674 7 transmembrane receptor (rhodopsin family) 128762 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 128854 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 128872 HMG (high mobility group) box 129032 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 129049 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 129081 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 129097 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 129116 Aconitase family (aconitate hydratase) 129116 Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism 129186 Intermediate filament protein 129316 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 129374 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analogu 129402 'Cold-shock' DNA-binding domain 129426 Ribosomal protein L19e 129439 Actin 129481 Intermediate filament protein 129486 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 129499 HMG (high mobility group) box 129522 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 129522 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 129531 MIT domain 129560 S-adenosyl-L-homocysteine hydrolase 129560 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 129602 Phosphatidylinositol 3- and 4-kinase 129607 Thymidylate kinase 129684 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 129684 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 129804 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 129841 Ribosomal protein S2 129844 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 129868 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 129870 HMG (high mobility group) box 130023 S-adenosyl-L-homocysteine hydrolase 130023 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 130075 7 transmembrane receptor (rhodopsin family) 130120 Lectin C-type domain. This family includes both long and short form C-type 130195 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 130271 PH domain. PH stands for pleckstrin homology 130271 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 130271 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 130340 Clathrin adaptor complex small chain 130340 Clathrin adaptor complex small chain 130340 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 130367 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 130399 Protein kinase domain 130399 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 130429 Ribosomal protein S2 130430 Translationally controlled tumor protein 130450 Vinculin family 130502 TPR Domain 130505 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 130517 Sulfotransferase protein 130535 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 130589 Aldose 1-epimerase 130600 Ribosomal protein L14p/L23e 130633 Zinc finger, C3HC4 type (RING finger) 130633 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 130672 Ribosomal protein S19 130678 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 130728 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 130752 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 130773 Ribosomal protein L23 130888 F-box domain 130916 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 130916 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 130985 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 131034 C2 domain 131055 Cyclophilin type peptidyl-prolyl cis-trans isomerase 131096 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 131096 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 131117 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 131118 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 131149 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 131149 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 131185 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 131185 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 131284 7 transmembrane receptor (rhodopsin family) 131364 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 131375 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme 131377 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 131377 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 131405 B-box zinc finger 131405 Filamin/ABP280 repeat 131405 Ribosomal protein L23 131405 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph 131523 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 131566 LCCL domain 131566 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 131572 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 131691 Cyclophilin type peptidyl-prolyl cis-trans isomerase 131692 Cyclophilin type peptidyl-prolyl cis-trans isomerase 131692 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 131873 von Willebrand factor type A domain 131880 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 131889 Intermediate filament protein 131961 FMN-dependent dehydrogenase 131961 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor 131961 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 131973 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 132048 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 132049 Polyribonucleotide nucleotidyltransferase, RNA binding domain. This family contains the RNA binding domain of Polyribonucleotide nucleotidyltransferase (PNPase) PNPase is involved in mRNA degradation in a 3'-5' direction 132049 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 132049 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 132170 S25 ribosomal protein 132187 Protein kinase domain 132204 Synaptophysin / synaptoporin 132205 Intermediate filament protein 132243 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 132391 Intermediate filament protein 132430 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 132524 Peptidase family M49 132535 Ribosomal protein S8 132556 NAC domain 132562 Zinc-binding dehydrogenase 132602 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 132612 Adenosine-deaminase (editase) domain 132689 B-box zinc finger 132689 Zinc finger, C3HC4 type (RING finger) 132689 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 132704 Intermediate filament protein 132719 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 132719 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 132722 Trypsin 132724 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 132724 Trypsin 132724 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 132789 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 132798 UDP-glucoronosyl and UDP-glucosyl transferase 132864 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 132869 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 132869 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 132929 Intermediate filament protein 132946 ADP-ribosylation factor family 132949 AMP-binding enzyme 132949 Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix b 132954 Phosducin 133052 Intermediate filament protein 133060 Protein of unknown function, DUF270 133083 Insulinase (Peptidase family M16) 133121 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 133205 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 133214 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 133256 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 133277 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 133277 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 133299 Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocyst 133308 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 133320 Ribosomal protein S26e 133332 Ribosomal protein S5, C-terminal domain 133384 Transforming growth factor beta like domain 133418 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 133482 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 133482 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 133486 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 133486 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 133539 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 133569 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 133584 Fibronectin type III domain 133584 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 133584 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 133584 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 133609 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 133628 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 133629 Ribosomal protein L5 133629 ribosomal L5P family C-terminus. This region is found associated with pfam00281 133635 Ribosomal protein L44 133666 Ribosomal protein L19e 133686 ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus 133688 UDP-glucoronosyl and UDP-glucosyl transferase 133730 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 133744 Ribosomal S3Ae family 133789 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 133790 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 134082 7 transmembrane receptor (rhodopsin family) 134083 7 transmembrane receptor (rhodopsin family) 134111 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 134187 Homeobox domain 134265 PH domain. PH stands for pleckstrin homology 134337 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 134356 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 134357 Aconitase family (aconitate hydratase) 134357 Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism 134391 7 transmembrane receptor (rhodopsin family) 134412 Intermediate filament protein 134429 START domain 134430 Utp21 specific WD40 associated domain. Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA 134469 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 134478 Actin 134479 Intermediate filament protein 134492 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 134494 Tropomyosin 134502 PH domain. PH stands for pleckstrin homology 134502 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 134530 Cyclophilin type peptidyl-prolyl cis-trans isomerase 134579 Ribosomal protein S7e 134581 Intermediate filament protein 134611 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 134637 Cytidine and deoxycytidylate deaminase zinc-binding region 134725 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 134800 Stathmin family 134829 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 134829 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 134850 Ribosomal protein L21e 134860 7 transmembrane receptor (rhodopsin family) 134864 7 transmembrane receptor (rhodopsin family) 134870 Ribosomal protein L23 134897 SH2 domain 134897 Protein-tyrosine phosphatase 135112 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 135112 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 135114 HIT family 135152 Glycosyltransferase family 43 135228 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 135250 Class I Histocompatibility antigen, domains alpha 1 and 2 135257 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 135270 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 135293 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 135458 Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a 135521 FKBP-type peptidyl-prolyl cis-trans isomerase 135522 Intermediate filament protein 135892 B-box zinc finger 135892 B-box zinc finger 135892 Zinc finger, C3HC4 type (RING finger) 135892 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 135896 7 transmembrane receptor (rhodopsin family) 135905 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 135924 7 transmembrane receptor (rhodopsin family) 135935 Homeobox domain 135935 Cyclophilin type peptidyl-prolyl cis-trans isomerase 135935 Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses 135941 7 transmembrane receptor (rhodopsin family) 135942 7 transmembrane receptor (rhodopsin family) 135943 7 transmembrane receptor (rhodopsin family) 135944 7 transmembrane receptor (rhodopsin family) 135946 7 transmembrane receptor (rhodopsin family) 135948 7 transmembrane receptor (rhodopsin family) 136046 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 136132 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 136143 Eukaryotic ribosomal protein L18 136178 GDP dissociation inhibitor 136242 Trypsin 136248 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 136256 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 136306 Sugar (and other) transporter 136306 Sugar (and other) transporter 136307 FMN-dependent dehydrogenase 136307 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor 136307 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 136321 Ribosomal L15 136337 Elongation factor 1 gamma, conserved domain 136371 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 136505 Ribosomal protein L21e 136992 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 137075 PMP-22/EMP/MP20/Claudin family 137107 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 137133 Aminotransferase class-V 137165 Ribosomal protein L3 137202 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 137207 CUB domain 137520 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 137814 Homeobox domain 137839 Protein kinase domain 137885 Cyclophilin type peptidyl-prolyl cis-trans isomerase 137886 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 137902 Animal haem peroxidase 137902 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 137964 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 138031 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation o 138040 Intermediate filament protein 138115 S-adenosyl-L-homocysteine hydrolase 138151 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 138160 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 138198 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 138267 Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein 138412 Mitochondrial carrier protein 138414 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 138416 Intermediate filament protein 138428 Peptidyl-tRNA hydrolase 138453 Ribosomal protein S5, C-terminal domain 138453 Ribosomal protein S5, N-terminal domain 138512 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 138512 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 138562 14-3-3 protein 138604 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 138649 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 138652 Trypsin 138715 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 138721 Glutamine synthetase, beta-Grasp domain 138721 Glutamine synthetase, catalytic domain 138741 7 transmembrane receptor (rhodopsin family) 138742 7 transmembrane receptor (rhodopsin family) 138748 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 138799 7 transmembrane receptor (rhodopsin family) 138802 7 transmembrane receptor (rhodopsin family) 138803 7 transmembrane receptor (rhodopsin family) 138804 7 transmembrane receptor (rhodopsin family) 138805 7 transmembrane receptor (rhodopsin family) 138850 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 138855 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 138879 Intermediate filament protein 138881 7 transmembrane receptor (rhodopsin family) 138882 7 transmembrane receptor (rhodopsin family) 138883 7 transmembrane receptor (rhodopsin family) 138888 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 138924 Cyclophilin type peptidyl-prolyl cis-trans isomerase 138925 Giardia variant-specific surface protein 138926 Giardia variant-specific surface protein 138932 Microtubule associated protein (MAP65/ASE1 family) 138961 Nucleoside diphosphate kinase 138971 Peptidase family M49 139060 Intermediate filament protein 139065 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 139075 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139076 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139081 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 139081 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 139084 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 139115 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 139125 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 139170 WD domain, G-beta repeat 139171 Cytochrome C and Quinol oxidase polypeptide I 139171 NADH-ubiquinone/plastoquinone oxidoreductase chain 4L 139189 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 139189 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 139189 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 139193 HMG (high mobility group) box 139211 Intermediate filament protein 139216 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 139216 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 139226 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 139242 Ribosomal protein S17 139270 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 139270 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 139271 7 transmembrane receptor (rhodopsin family) 139272 HSF-type DNA-binding 139272 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 139275 eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 139275 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 139279 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 139281 Intermediate filament protein 139282 14-3-3 protein 139284 SH2 domain 139284 Phosphotyrosine interaction domain (PTB/PID) 139302 ENV polyprotein (coat polyprotein) 139304 Ribosomal protein L6e 139306 HMG (high mobility group) box 139309 Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi 139309 Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino 139309 Ku70/Ku80 beta-barrel domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the central DNA-bin 139318 Cyclophilin type peptidyl-prolyl cis-trans isomerase 139324 Homeobox domain 139334 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 139334 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 139340 Polyprenyl synthetase 139362 Intermediate filament protein 139363 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139378 7 transmembrane receptor (Secretin family) 139385 Intermediate filament protein 139390 Cyclic nucleotide-binding domain 139390 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 139397 Ribosomal protein L3 139401 Translationally controlled tumor protein 139411 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 139422 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 139422 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139423 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139425 WD domain, G-beta repeat 139431 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 139433 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 139451 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139452 Ribosomal protein L6e 139457 Translationally controlled tumor protein 139473 7 transmembrane receptor (rhodopsin family) 139497 ATP synthase subunit C 139542 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 139543 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 139543 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 139562 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 139576 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 139599 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 139604 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 139628 Fork head domain 139647 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 139649 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 139707 Fork head domain 139728 Protein kinase domain 139735 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 139735 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 139741 Actin 139748 Intermediate filament protein 139760 7 transmembrane receptor (rhodopsin family) 139760 7 transmembrane receptor (rhodopsin family) 139805 Ribosomal protein L36e 139808 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 139808 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 139835 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 139839 Intermediate filament protein 139839 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 139871 7 transmembrane receptor (rhodopsin family) 139880 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 139886 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 139927 Ribosomal protein S7e 139932 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 139952 Protein kinase domain 139952 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 139957 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellul 140004 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 140028 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 140036 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 140038 Trypsin 140065 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 140100 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 140101 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 140123 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 140129 'chromo' (CHRromatin Organization MOdifier) domain 140129 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 140149 'chromo' (CHRromatin Organization MOdifier) domain 140149 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 140164 PXA domain. This domain is associated with PX domains pfam00787 140338 Ribosomal protein L23 140338 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 140446 Intermediate filament protein 140447 Calx-beta domain 140447 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 140448 Calx-beta domain 140448 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 140453 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 140456 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140458 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140459 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140459 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140473 7 transmembrane receptor (rhodopsin family) 140474 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 140474 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 140477 Homeobox domain 140479 7 transmembrane receptor (rhodopsin family) 140481 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 140482 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 140489 Helix-loop-helix DNA-binding domain 140491 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 140491 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 140492 Calcium-activated SK potassium channel 140492 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 140493 Calcium-activated SK potassium channel 140493 pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha- 140494 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 140497 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 140498 7 transmembrane receptor (rhodopsin family) 140500 PH domain. PH stands for pleckstrin homology 140500 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140500 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 140500 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140547 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 140568 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 140571 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 140576 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 140577 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 140578 Lectin C-type domain. This family includes both long and short form C-type 140584 wnt family 140585 Protein kinase domain 140586 HMG (high mobility group) box 140592 Uncharacterized ACR, COG1579 140592 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 140592 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 140592 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 140592 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 140593 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 140594 C2 domain 140594 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 140595 7 transmembrane receptor (rhodopsin family) 140625 Actin 140638 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 140654 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 140654 Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer 140655 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 140661 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 140667 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 140670 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 140670 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 140674 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 140674 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 140675 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 140675 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 140679 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 140683 Coronavirus nucleocapsid protein 140693 PH domain. PH stands for pleckstrin homology 140694 Dynamin GTPase effector domain 140694 PH domain. PH stands for pleckstrin homology 140694 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 140700 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 140701 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 140721 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 140723 PMP-22/EMP/MP20/Claudin family 140724 PMP-22/EMP/MP20/Claudin family 140725 PMP-22/EMP/MP20/Claudin family 140726 PMP-22/EMP/MP20/Claudin family 140728 PMP-22/EMP/MP20/Claudin family 140729 PMP-22/EMP/MP20/Claudin family 140732 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 140734 Dynein light chain type 1 140735 Dynein light chain type 1 140766 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 140766 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 140766 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 140766 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 140781 Myosin head (motor domain) 140795 7 transmembrane receptor (rhodopsin family) 140801 Ribosomal L10 140803 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 140803 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 140807 Intermediate filament protein 140810 Protein kinase domain 140826 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 140836 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 140856 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 140860 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 140860 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 140866 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 140866 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 140868 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 140870 WAP-type (Whey Acidic Protein) 'four-disulfide core' 140894 Cyclic nucleotide-binding domain 140898 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 140902 SCP-like extracellular protein. This domain is also found in prokaryotes 140910 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 140915 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 140915 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 140922 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 140926 Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined 140928 3'5'-cyclic nucleotide phosphodiesterase 140929 3'5'-cyclic nucleotide phosphodiesterase 140931 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 140932 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 140933 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 140935 PH domain. PH stands for pleckstrin homology 140939 B-box zinc finger 140941 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 140943 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 140944 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 140945 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 140945 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 140946 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 140946 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 142679 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 142680 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 142681 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 142685 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 142688 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 142688 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 142689 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 142827 AMP-binding enzyme 142850 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 142891 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 142901 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 142910 ab-hydrolase associated lipase region 142980 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 143038 Enolase, C-terminal TIM barrel domain 143098 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 143125 Ribosomal protein L21e 143158 PH domain. PH stands for pleckstrin homology 143158 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 143158 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 143243 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 143244 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 143339 Homeobox domain 143383 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 143425 C2 domain 143496 7 transmembrane receptor (rhodopsin family) 143502 7 transmembrane receptor (rhodopsin family) 143503 7 transmembrane receptor (rhodopsin family) 143543 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 143548 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 143689 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 143689 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 143712 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 143712 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 143828 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 143879 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 143879 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 143903 Lectin C-type domain. This family includes both long and short form C-type 143903 Lectin C-type domain. This family includes both long and short form C-type 144124 7 transmembrane receptor (rhodopsin family) 144124 7 transmembrane receptor (rhodopsin family) 144125 7 transmembrane receptor (rhodopsin family) 144132 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 144195 Sugar (and other) transporter 144247 Ribosomal protein S2 144253 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 144402 C2 domain 144404 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 144423 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 144448 Tetraspanin family 144453 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 144455 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 144483 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 144501 Intermediate filament protein 144501 Intermediate filament protein 144501 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 144568 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 144581 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 144715 Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage 144717 PH domain. PH stands for pleckstrin homology 144767 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 144983 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 145009 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 145047 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 145173 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 145226 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 145258 Homeobox domain 145264 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 145288 Ribosomal protein L11, RNA binding domain 145288 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 145371 ADP-ribosylation factor family 145371 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 145371 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 145389 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 145414 Ribosomal protein L3 145497 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 145501 AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges 145501 Thrombospondin type 1 domain 145501 AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges 145567 Coatomer WD associated domain 145581 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 145767 Ribosomal S3Ae family 145790 C2 domain 145799 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 145800 Dynamin GTPase effector domain 145814 Pyroglutamyl peptidase 145990 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 146053 Ribosomal S3Ae family 146057 Protein kinase domain 146057 Intermediate filament protein 146110 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 146223 Synaptogyrin. This family of proteins is distantly related to pfam01284 146223 Synaptogyrin. This family of proteins is distantly related to pfam01284 146223 Synaptogyrin. This family of proteins is distantly related to pfam01284 146279 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 146310 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result 146347 Ribosomal protein S2 146395 PMP-22/EMP/MP20/Claudin family 146434 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 146456 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 146485 Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps 146488 Sodium:neurotransmitter symporter family 146603 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 146656 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 146691 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 146754 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 146818 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 147015 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 147040 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 147072 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 147079 Ribosomal protein S26e 147111 Pectinacetylesterase 147111 Pectinacetylesterase 147166 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 147183 Intermediate filament protein 147183 Intermediate filament protein 147199 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 147228 Intermediate filament protein 147323 START domain 147346 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 147358 Ribosomal S3Ae family 147374 Glutamate/Leucine/Phenylalanine/Valine dehydrogenase 147381 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 147398 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 147400 Helper component proteinase. This protein is found in genome polyproteins of potyviruses 147400 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 147407 Mitochondrial carrier protein 147409 Cadherin domain 147409 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 147655 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 147658 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147664 ENV polyprotein (coat polyprotein) 147686 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147687 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147694 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147699 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 147740 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 147740 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 147742 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147804 Tropomyosin 147807 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 147808 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 147837 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147923 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 147980 Ribosomal protein L21e 148003 Galactoside-binding lectin 148064 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 148066 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 148066 Zinc finger, C3HC4 type (RING finger) 148066 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 148103 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 148203 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 148206 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 148254 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 148276 7 transmembrane receptor (rhodopsin family) 148281 C2 domain 148330 Ribosomal protein S2 148356 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 148362 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 148370 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 148430 Ribosomal protein S5, C-terminal domain 148430 Ribosomal protein S5, N-terminal domain 148446 Reeler domain 148479 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 148549 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 148640 Ribosomal protein S19 148709 Actin 148713 Protein-tyrosine phosphatase 148789 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 148854 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 148864 Homeobox domain 148864 LIM domain. This family represents two copies of the LIM structural domain 148932 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 148947 Ribosomal protein S2 148953 Ribosomal protein L21e 148955 Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subun 148960 Ribosomal protein S5, C-terminal domain 148960 Ribosomal protein S5, N-terminal domain 148995 Ribosomal protein S7e 149041 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 149050 Aminotransferase class-V 149086 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 149111 Cornichon protein 149224 Ribosomal protein S7e 149318 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 149329 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 149360 Ribosomal protein L24e 149461 PMP-22/EMP/MP20/Claudin family 149476 Ribosomal protein S8 149501 Intermediate filament protein 149501 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 149620 Glycosyl hydrolases family 18 149620 Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse 149628 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 149767 HMG (high mobility group) box 150000 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 150000 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 150160 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 150200 Ubiquitin carboxyl-terminal hydrolase family 2 150207 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 150209 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 150212 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 150221 Fibronectin type III domain 150221 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 150280 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 150287 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 150356 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 150379 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 150400 Ribosomal L29 protein 150445 Phosphatidylethanolamine-binding protein 150465 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 150472 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 150483 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 150498 14-3-3 protein 150519 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 150572 MYND finger 150572 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 150580 HMG14 and HMG17 150610 Phosphorylase family 2 150681 7 transmembrane receptor (rhodopsin family) 150739 Intermediate filament protein 150739 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 150771 Mab-21 protein 150771 Mab-21 protein 150821 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 150868 Ribosomal protein L11, RNA binding domain 150868 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 150940 WD domain, G-beta repeat 150948 PMP-22/EMP/MP20/Claudin family 150969 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 150978 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 150984 Eukaryotic porin 151011 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 151011 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 151056 Lipase/Acylhydrolase with GDSL-like motif 151094 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 151155 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 151195 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 151234 Sulfotransferase protein 151258 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 151295 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic 151313 Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the tyr 151320 HMG (high mobility group) box 151437 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 151449 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 151449 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 151451 Transposase family tnp2 151457 TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae 151470 Tropomyosin 151516 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 151561 Core histone H2A/H2B/H3/H4 151579 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 151723 Homeobox domain 151741 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 151790 WD domain, G-beta repeat 151825 Intermediate filament protein 151835 C2 domain 151956 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 151963 Mab-21 protein 151963 Mab-21 protein 151973 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 152002 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 152015 Regulatory subunit of type II PKA R-subunit 152059 CAP protein 152098 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 152101 Homeobox domain 152160 Ribosomal protein L21e 152330 Fibronectin type III domain 152330 Fibronectin type III domain 152330 Fibronectin type III domain 152503 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 152504 Ribosomal S3Ae family 152559 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 152579 Sec1 family 152682 NAC domain 152829 WD domain, G-beta repeat 152831 Glycosyl hydrolase family 1 152890 Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold 152905 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 152922 Nucleotide-sensitive chloride conductance regulator (ICln) 152972 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 153027 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 153129 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 153201 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 153217 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 153218 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 153328 Mitochondrial carrier protein 153364 Metallo-beta-lactamase superfamily 153393 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 153561 Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 153565 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 153642 Sulfatase 153817 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 153886 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 154062 Nitroreductase family. Members of this family utilise FMN as a cofactor 154064 Class I Histocompatibility antigen, domains alpha 1 and 2 154091 Sugar (and other) transporter 154094 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 154150 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 154165 Ribosomal protein L21e 154337 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 154337 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 154516 UDP-glucoronosyl and UDP-glucosyl transferase 154545 Ribosomal S17 154562 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 154661 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 154849 14-3-3 protein 154865 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 154865 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 154866 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 154881 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 154982 Protein kinase domain 154985 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 154995 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 155012 Sushi domain (SCR repeat) 155051 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 155054 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 155100 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 155205 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 155268 PMP-22/EMP/MP20/Claudin family 155423 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 155696 LCCL domain 155696 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 155812 Fibronectin type III domain 155812 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 156726 Runt domain 156873 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 157285 Protein kinase domain 157295 Mammalian defensin 157295 Defensin propeptide 157310 Phosphatidylethanolamine-binding protein 157317 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 157506 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 157543 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 157552 Ribosomal protein S2 157567 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 157574 F-box domain 157589 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 157589 Pentapeptide repeats (8 copies). These repeats are found in many cyanobacterial proteins. The repeats were first identified in hglK. The function of these repeats is unknown. The structure of this repeat has been predicted to be a beta-helix. The repeat c 157589 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 157667 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 157708 Signal peptidase I 157777 MCM2/3/5 family 157784 Tropomyosin 157793 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 157807 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 157807 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 157848 Homeobox domain 158056 MAM domain. An extracellular domain found in many receptors 158062 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 158067 Adenylate kinase 158104 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 158125 WD domain, G-beta repeat 158131 7 transmembrane receptor (rhodopsin family) 158135 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 158160 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 158160 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 158200 Ribosomal protein S26e 158204 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 158234 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 158237 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 158237 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 158284 Protein kinase domain 158345 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 158352 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 158381 A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation 158399 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 158399 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 158452 Cyclin-dependent kinase regulatory subunit 158452 Arginosuccinate synthase. This family contains a PP-loop motif 158473 Ribosomal protein S2 158624 FMN-dependent dehydrogenase 158624 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor 158624 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 158624 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 158656 Ribosomal protein S2 158668 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 158678 Ribosomal protein S2 158747 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 158773 Ribosomal protein S5, C-terminal domain 158773 Ribosomal protein S5, N-terminal domain 158777 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 158800 Homeobox domain 158809 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 158833 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 158835 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 158880 Ubiquitin carboxyl-terminal hydrolase family 2 158880 Ubiquitin carboxyl-terminal hydrolases family 2 158943 RanBP1 domain 158983 Core histone H2A/H2B/H3/H4 158997 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 159119 HSF-type DNA-binding 159163 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 159184 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 159296 Homeobox domain 159371 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 159686 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 159815 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 159963 Sodium:solute symporter family 160382 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 160418 TPR Domain 160428 pfam02911, formyl_trans_C, Formyl transferase, C-terminal domain 160428 Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate defor 160428 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 160492 Intermediate filament tail domain 160518 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 160518 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 160518 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 160518 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 160518 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 160622 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 160851 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 160851 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 161003 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 161253 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 161291 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 161356 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 161357 MAM domain. An extracellular domain found in many receptors 161452 HMG (high mobility group) box 161476 Hsp20/alpha crystallin family 161477 Ribosomal protein S15 161717 Dynamin GTPase effector domain 161823 Adenosine/AMP deaminase 162073 Mab-21 protein 162282 Fibronectin type III domain 162466 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 162494 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 162514 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 162540 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 162540 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 162605 Intermediate filament protein 162605 Intermediate filament protein 162655 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 162835 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 162952 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 162962 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 162968 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 162993 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 162998 7 transmembrane receptor (rhodopsin family) 163049 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 163050 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 163051 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 163059 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 163071 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 163175 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 163201 7 transmembrane receptor (rhodopsin family) 163233 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 163351 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 163404 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 163486 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 163486 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 163486 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 163589 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 163742 Mitochondrial carrier protein 163742 Mitochondrial carrier protein 163782 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 163786 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 164022 Cyclophilin type peptidyl-prolyl cis-trans isomerase 164022 Cyclophilin type peptidyl-prolyl cis-trans isomerase 164036 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 164036 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 164045 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 164091 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h 164091 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 164153 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 164395 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 164633 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 164668 Cytidine and deoxycytidylate deaminase zinc-binding region 164668 Cytidine and deoxycytidylate deaminase zinc-binding region 164781 WD domain, G-beta repeat 164832 Zinc finger, C3HC4 type (RING finger) 164832 ATP-dependent protease La (LON) domain 165082 7 transmembrane receptor (Secretin family) 165140 7 transmembrane receptor (rhodopsin family) 165257 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 165324 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 165324 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 165829 7 transmembrane receptor (metabotropic glutamate family) 166012 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 166336 LIM domain. This family represents two copies of the LIM structural domain 166348 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 166348 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 166379 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 166496 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 166522 SH2 domain 166647 7 transmembrane receptor (Secretin family) 166647 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 166647 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 166657 B-box zinc finger 166657 Zinc finger, C3HC4 type (RING finger) 166657 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 166752 Calx-beta domain 166752 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 166793 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 166815 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 166815 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 166968 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 166974 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 166975 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 167127 UDP-glucoronosyl and UDP-glucosyl transferase 167227 NUDIX domain 167681 Trypsin 167826 Helix-loop-helix DNA-binding domain 168283 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 168391 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 168433 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 168474 AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d 168507 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 168507 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 168507 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 168507 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c 168620 Helix-loop-helix DNA-binding domain 168667 von Willebrand factor type D domain 168667 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 168975 Cyclic nucleotide-binding domain 169044 von Willebrand factor type A domain 169044 Thrombospondin N-terminal -like domain 169166 PX domain. PX domains bind to phosphoinositides 169355 Indoleamine 2,3-dioxygenase 169436 Protein kinase domain 169505 Shikimate kinase 169522 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 169522 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 169611 Olfactomedin-like domain 169611 Olfactomedin-like domain 169714 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 169831 PX domain. PX domains bind to phosphoinositides 169841 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 169981 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 170106 Actin 170302 Homeobox domain 170384 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 170392 Zona pellucida-like domain 170439 GNS1/SUR4 family 170441 Sugar (and other) transporter 170442 Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1 170460 START domain 170461 START domain 170465 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 170483 Bacterial extracellular solute-binding proteins, family 3 170483 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 170484 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 170496 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 170501 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 170520 Globin 170538 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 170544 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 170567 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 170568 Zona pellucida-like domain 170571 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 170571 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 170572 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 170572 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 170573 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 170574 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 170575 GTPase of unknown function 170577 Kinase associated domain 1 170587 Citrate synthase 170591 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 170625 PX domain. PX domains bind to phosphoinositides 170633 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 170634 7 transmembrane receptor (metabotropic glutamate family) 170634 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 170635 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 170639 7 transmembrane receptor (rhodopsin family) 170641 Galactose binding lectin domain 170641 7 transmembrane receptor (Secretin family) 170641 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 170641 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou 170643 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 170667 ATP synthase subunit C 170668 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 170669 Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This 170670 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 170672 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 170672 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 170673 Paralemmin 170676 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 170677 Cadherin domain 170685 NUDIX domain 170689 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 170689 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 170690 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 170690 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 170691 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 170691 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 170692 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 170692 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 170698 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 170698 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 170700 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling 170704 7 transmembrane receptor (rhodopsin family) 170714 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn 170715 Poly-adenylate binding protein, unique domain 170716 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 170719 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 170720 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 170721 Thrombospondin type 1 domain 170721 Thrombospondin type 1 domain 170721 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 170721 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 170722 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl 170722 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle 170725 Calpain family cysteine protease 170725 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 170729 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 170732 7 transmembrane receptor (rhodopsin family) 170733 Lectin C-type domain. This family includes both long and short form C-type 170735 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 170735 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 170736 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 170738 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170738 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170738 Cyclic nucleotide-binding domain 170738 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170739 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170743 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 170744 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 170752 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc 170755 PX domain. PX domains bind to phosphoinositides 170755 PX domain. PX domains bind to phosphoinositides 170756 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 170757 7 transmembrane receptor (Secretin family) 170757 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 170760 Acyl CoA binding protein 170760 Acyl CoA binding protein 170762 Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein 170768 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 170768 6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta 170774 Caveolin 170776 Lectin C-type domain. This family includes both long and short form C-type 170779 Lectin C-type domain. This family includes both long and short form C-type 170780 Lectin C-type domain. This family includes both long and short form C-type 170787 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 170808 Calpain family cysteine protease 170808 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 170813 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 170816 7 transmembrane receptor (rhodopsin family) 170825 Homeobox domain 170829 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 170842 BTG1 family. A novel family of anti-proliferative proteins 170847 Inward rectifier potassium channel 170848 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 170848 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170849 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 170850 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 170850 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170850 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 170850 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170895 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 170898 Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1 170899 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170899 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 170903 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 170905 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 170907 TMS membrane protein/tumour differentially expressed protein (TDE) 170908 Protein kinase domain 170909 Ets-domain 170909 Sterile alpha motif (SAM)/Pointed domain 170911 Phosphatidylinositol 3- and 4-kinase 170911 PI3-kinase family, p85-binding domain 170911 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 170911 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 170911 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 170914 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 170917 FAD binding domain of DNA photolyase 170920 Protein kinase domain 170920 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 170921 Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains 170922 Protein kinase domain 170924 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 170924 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 170925 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 170925 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 170926 7 transmembrane receptor (rhodopsin family) 170928 Uncharacterized ACR, YneC family COG1359 170929 Apoptosis regulator proteins, Bcl-2 family 170936 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 170936 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 170936 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 170938 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 170943 Mitochondrial carrier protein 170945 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 170945 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 170956 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 170959 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 170961 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 171009 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 171011 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 171011 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 171015 Oxidoreductase FAD-binding domain 171015 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 171016 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 171019 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 171019 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 171024 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 171026 Protein kinase A anchor 171039 C2 domain 171039 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 171044 7 transmembrane receptor (rhodopsin family) 171045 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 171045 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 171045 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase 171046 Helix-loop-helix DNA-binding domain 171048 Tetraspanin family 171049 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 171050 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 171052 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 171052 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 171052 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 171053 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 171054 7 transmembrane receptor (rhodopsin family) 171056 7 transmembrane receptor (rhodopsin family) 171057 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 171058 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 171062 PMP-22/EMP/MP20/Claudin family 171065 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 171068 Homeobox domain 171068 Pou domain - N-terminal to homeobox domain 171072 Sulfotransferase protein 171073 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 171074 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 171078 Helix-loop-helix DNA-binding domain 171079 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 171082 ATP synthase subunit C 171084 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 171085 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic 171085 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 171087 Uncharacterised protein family (UPF0041) 171090 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 171091 Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this re 171092 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 171097 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 171104 7 transmembrane receptor (rhodopsin family) 171105 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 171106 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 171108 7 transmembrane receptor (rhodopsin family) 171109 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 171114 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 171115 3'5'-cyclic nucleotide phosphodiesterase 171118 Carboxylesterase 171119 Fibronectin type III domain 171123 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 171126 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 171131 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 171131 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 171133 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl grou 171134 Galactoside-binding lectin 171135 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi 171136 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 171136 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co 171136 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser 171136 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 171136 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily con 171136 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserv 171138 Slow voltage-gated potassium channel 171142 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 171143 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 171144 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 171144 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 171146 Cyclic nucleotide-binding domain 171146 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 171146 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 171147 Sugar (and other) transporter 171151 Mitochondrial carrier protein 171152 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra 171154 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 171155 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 171162 Lectin C-type domain. This family includes both long and short form C-type 171163 Sodium:neurotransmitter symporter family 171164 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 171164 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 171166 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 171167 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 171171 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 171180 C2 domain 171183 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171184 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171185 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171186 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171187 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171188 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171189 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171190 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171191 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171192 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171193 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171194 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171195 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171195 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171196 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171197 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171198 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171199 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171200 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171201 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171202 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171203 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171205 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171206 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171207 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 171224 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171225 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171226 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171227 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171228 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171229 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171230 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171231 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171232 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171233 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171234 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171235 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171236 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171237 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171238 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171239 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171240 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171241 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171242 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171243 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171244 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171245 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171246 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171247 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171248 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171250 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171251 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171252 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171253 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171254 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171255 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171256 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171257 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171258 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171259 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171260 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171261 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171262 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171263 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171264 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171265 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171266 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171267 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171268 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171269 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171270 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171271 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171272 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171273 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171274 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171275 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171277 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171279 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 171287 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 171291 Kinesin motor domain 171293 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi 171296 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 171297 Carboxylesterase 171299 Fork head domain 171333 Fibronectin type III domain 171341 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 171350 Poly-adenylate binding protein, unique domain 171352 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 171355 Homeobox domain 171355 Pou domain - N-terminal to homeobox domain 171357 Eukaryotic-type carbonic anhydrase 171357 Protein-tyrosine phosphatase 171360 Homeobox domain 171362 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 171362 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 171365 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 171366 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 171372 Sugar (and other) transporter 171374 ATP synthase alpha/beta chain, C terminal domain 171374 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 171374 pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 171380 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 171381 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 171382 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 171384 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 171387 Zinc finger, C3HC4 type (RING finger) 171396 Sulfatase 171397 Sugar (and other) transporter 171398 Sugar (and other) transporter 171399 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr 171400 GNS1/SUR4 family 171402 GNS1/SUR4 family 171406 Somatotropin hormone family 171408 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 171415 Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 171415 Autophagocytosis associated protein, N-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 171429 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 171429 Sulfate transporter family. Mutations may lead to several human diseases 171432 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 171440 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 171441 Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1 171442 Sugar (and other) transporter 171447 Galactose binding lectin domain 171447 7 transmembrane receptor (Secretin family) 171447 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 171447 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou 171449 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 171454 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 171455 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 171458 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 171459 Zona pellucida-like domain 171469 7 transmembrane receptor (rhodopsin family) 171469 7 transmembrane receptor (rhodopsin family) 171493 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 171494 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 171494 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 171495 C2 domain 171495 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b 171497 Protein kinase domain 171497 Protein kinase C terminal domain 171506 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 171516 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 171517 Glypican 171518 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 171521 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 171522 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 171528 Trypsin 171529 Kinesin motor domain 171531 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 171551 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 171552 Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the sequ 171553 Glycosyltransferase family 6 171554 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12 171556 Somatotropin hormone family 171562 Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER 171564 Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal 171565 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has 171571 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 171572 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 171574 Dynamin family 171574 Dynamin GTPase effector domain 171574 PH domain. PH stands for pleckstrin homology 171575 Deoxyribonuclease II 171577 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a 171580 LIM domain. This family represents two copies of the LIM structural domain 171580 FAD binding domain. This domain is involved in FAD binding in a number of enzymes 171580 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 191569 3'5'-cyclic nucleotide phosphodiesterase 191570 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 191571 Cadherin domain 191571 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 191574 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 191575 Nucleoside diphosphate kinase 191576 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 191578 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 191578 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 192117 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 192119 Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contai 192119 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 192119 RNase3 domain 192119 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 192119 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 192119 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 192119 Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contain 192134 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 192153 7 transmembrane receptor (rhodopsin family) 192154 I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin 192155 Surfactant protein B 192155 Saposin A-type domain 192167 Carboxylesterase 192176 Filamin/ABP280 repeat 192176 Filamin/ABP280 repeat 192177 C2 domain 192179 Myelin proteolipid protein (PLP or lipophilin) 192185 ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus 192188 Extracellular link domain 192188 Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria 192189 C2 domain 192190 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 192192 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 192193 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 192195 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 192195 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 192195 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 192196 Protein of unknown function, DUF259 192198 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 192201 WAP-type (Whey Acidic Protein) 'four-disulfide core' 192203 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 192207 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 192208 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 192215 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 192218 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 192223 7 transmembrane receptor (rhodopsin family) 192224 Somatotropin hormone family 192226 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p 192234 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 192235 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 192241 ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401 192242 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 192245 Hsp20/alpha crystallin family 192246 Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs 192248 Cadherin domain 192255 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 192257 Carboxylesterase 192258 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 192258 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 192258 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 192259 Lipase/Acylhydrolase with GDSL-like motif 192260 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 192262 CUB domain 192262 Sushi domain (SCR repeat) 192267 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 192269 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 192271 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 192271 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 192273 Sugar (and other) transporter 192275 DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon s 192277 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 192280 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 192285 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 192287 Mitochondrial carrier protein 192289 Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1 192348 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 192351 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 192360 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes 192363 Phosphotyrosine interaction domain (PTB/PID) 192645 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 192646 7 transmembrane receptor (rhodopsin family) 192648 7 transmembrane receptor (rhodopsin family) 192649 7 transmembrane receptor (rhodopsin family) 192654 Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol 192656 Protein kinase domain 192656 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 192657 Occludin/ELL family 192662 RHO protein GDP dissociation inhibitor 192666 Intermediate filament protein 192669 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 192669 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 192669 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 192669 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 192670 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 192670 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 192678 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 192683 SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c 192690 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 192786 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 192797 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 192837 Ribosomal protein L21e 192882 Guanylate kinase 192882 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 192882 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 192882 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 192885 ENV polyprotein (coat polyprotein) 192895 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 192897 Calx-beta domain 192897 Fibronectin type III domain 192897 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 192897 Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF 192912 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 192940 Ribosomal protein L13e 192950 NAC domain 192970 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 192981 Eukaryotic ribosomal protein L18 193025 Ribosomal L22e protein family 193028 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 193053 7 transmembrane receptor (rhodopsin family) 193055 7 transmembrane receptor (rhodopsin family) 193058 7 transmembrane receptor (rhodopsin family) 193063 7 transmembrane receptor (rhodopsin family) 193073 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 193077 7 transmembrane receptor (rhodopsin family) 193079 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 193079 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 193098 ENV polyprotein (coat polyprotein) 193113 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 193113 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 193128 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 193137 7 transmembrane receptor (rhodopsin family) 193138 7 transmembrane receptor (rhodopsin family) 193139 7 transmembrane receptor (rhodopsin family) 193140 7 transmembrane receptor (rhodopsin family) 193147 7 transmembrane receptor (rhodopsin family) 193182 7 transmembrane receptor (rhodopsin family) 193190 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 193223 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 193237 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 193255 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 193296 Protein kinase domain 193301 Translation initiation factor SUI1 193312 Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain i 193325 L1 transposable element 193328 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 193330 Eukaryotic initiation factor 1A 193360 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193361 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193383 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 193403 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 193408 Core histone H2A/H2B/H3/H4 193418 Core histone H2A/H2B/H3/H4 193431 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 193433 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 193438 Triosephosphate isomerase 193445 Core histone H2A/H2B/H3/H4 193449 Core histone H2A/H2B/H3/H4 193449 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 193450 Core histone H2A/H2B/H3/H4 193450 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 193451 Ribosomal protein L21e 193454 Core histone H2A/H2B/H3/H4 193455 Core histone H2A/H2B/H3/H4 193455 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 193456 7 transmembrane receptor (rhodopsin family) 193456 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193459 7 transmembrane receptor (rhodopsin family) 193459 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193461 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193474 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 193475 7 transmembrane receptor (rhodopsin family) 193475 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193475 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 193476 7 transmembrane receptor (rhodopsin family) 193476 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 193476 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 193476 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193476 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 193477 ENV polyprotein (coat polyprotein) 193477 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 193478 Competence-damaged protein. CinA is the first gene in the competence-inducible (cin) operon, and is thought to be specifically required at some stage in the process of transformation. This Pfam family consists of putative competence-damaged proteins from 193478 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 193481 7 transmembrane receptor (rhodopsin family) 193481 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193482 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 193488 Somatotropin hormone family 193510 Core histone H2A/H2B/H3/H4 193510 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 193522 ENV polyprotein (coat polyprotein) 193522 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 193522 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 193525 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 193533 Proteasome A-type and B-type 193534 7 transmembrane receptor (rhodopsin family) 193543 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193546 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193556 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193564 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193565 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193574 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193576 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193582 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193587 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193588 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193589 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193590 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193678 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 193690 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 193690 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 193695 Protein kinase domain 193707 Ribosomal protein S5, N-terminal domain 193707 Ribosomal protein S5, C-terminal domain 193736 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 193736 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 193740 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 193742 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 193742 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 193742 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 193747 L1 transposable element 193747 7 transmembrane receptor (rhodopsin family) 193753 7 transmembrane receptor (rhodopsin family) 193762 Class I Histocompatibility antigen, domains alpha 1 and 2 193764 Class I Histocompatibility antigen, domains alpha 1 and 2 193765 Class I Histocompatibility antigen, domains alpha 1 and 2 193765 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 193766 Class I Histocompatibility antigen, domains alpha 1 and 2 193768 Zinc finger, C3HC4 type (RING finger) 193772 7 transmembrane receptor (rhodopsin family) 193773 7 transmembrane receptor (rhodopsin family) 193774 7 transmembrane receptor (rhodopsin family) 193782 7 transmembrane receptor (rhodopsin family) 193784 7 transmembrane receptor (rhodopsin family) 193785 7 transmembrane receptor (rhodopsin family) 193787 7 transmembrane receptor (rhodopsin family) 193788 7 transmembrane receptor (rhodopsin family) 193796 jmjC domain 193796 jmjN domain 193798 Ribosomal L29e protein family 193810 Protein kinase domain 193811 7 transmembrane receptor (rhodopsin family) 193821 7 transmembrane receptor (rhodopsin family) 193822 7 transmembrane receptor (rhodopsin family) 193824 7 transmembrane receptor (rhodopsin family) 193825 7 transmembrane receptor (rhodopsin family) 193827 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 193840 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 193840 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 193840 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 193840 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 193845 L1 transposable element 193845 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 193853 L1 transposable element 193887 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic 193921 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 193993 L1 transposable element 194003 C2 domain 194003 Type IV secretion system CagX conjugation protein 194003 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 194003 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 194003 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 194022 Ribosomal protein L6 194056 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 194103 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 194103 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 194119 Ribosomal protein S16 194157 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 194175 Protein kinase domain 194188 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 194189 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 194189 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 194189 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 194197 Ribosomal protein L5 194197 ribosomal L5P family C-terminus. This region is found associated with pfam00281 194198 Ribosomal protein S7e 194229 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 194229 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 194237 Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy 194240 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 194245 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 194245 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 194253 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 194253 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 194253 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 194261 7 transmembrane receptor (rhodopsin family) 194269 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 194269 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 194269 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 194299 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 194299 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 194299 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 194350 L1 transposable element 194352 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 194352 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 194357 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 194357 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 194360 Trypsin 194401 FAD binding domain. This domain is involved in FAD binding in a number of enzymes 194401 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 194419 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 194419 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 194431 7 transmembrane receptor (rhodopsin family) 194432 7 transmembrane receptor (rhodopsin family) 194433 7 transmembrane receptor (rhodopsin family) 194436 L1 transposable element 194440 L1 transposable element 194446 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 194456 Sulfotransferase protein 194456 Sulfotransferase protein 194481 Ribosomal protein S5, C-terminal domain 194481 Ribosomal protein S5, N-terminal domain 194507 GH3 auxin-responsive promoter 194507 Repeat in ubiquitin-activating (UBA) protein 194507 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 194508 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 194508 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 194508 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 194538 7 transmembrane receptor (rhodopsin family) 194539 7 transmembrane receptor (rhodopsin family) 194540 7 transmembrane receptor (rhodopsin family) 194578 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 194580 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 194586 Sulfotransferase protein 194587 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 194588 Homeobox domain 194597 Trypsin 194597 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 194604 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 194612 7 transmembrane receptor (metabotropic glutamate family) 194612 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 194633 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 194633 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 194633 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 194634 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 194634 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 194634 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 194642 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 194655 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 194655 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 194664 7 transmembrane receptor (rhodopsin family) 194673 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 194673 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 194694 L1 transposable element 194694 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 194700 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 194735 BTG1 family. A novel family of anti-proliferative proteins 194736 BTG1 family. A novel family of anti-proliferative proteins 194737 Homeobox domain 194738 Homeobox domain 194739 Homeobox domain 194740 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 194744 Mitochondrial carrier protein 194765 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 194776 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 194788 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 194788 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 194810 ENV polyprotein (coat polyprotein) 194812 Actin 194825 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 194826 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 194837 Actin 194840 Hsp90 protein 194843 Proteasome A-type and B-type 194856 Homeobox domain 194861 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 194861 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 194861 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 194861 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 194861 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 194862 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 194862 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 194862 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 194862 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 194862 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 194862 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 194862 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 194862 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 194868 L1 transposable element 194868 Repeat in ubiquitin-activating (UBA) protein 194868 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 194884 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 194884 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 194884 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 194884 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 194884 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 194887 Ribosomal protein S21e 194895 Protein-tyrosine phosphatase 194895 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 194905 Ribosomal protein L6 194910 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 194913 Ribosomal protein S5, C-terminal domain 194913 Ribosomal protein S5, N-terminal domain 194952 jmjC domain 194952 jmjC domain 194957 ENV polyprotein (coat polyprotein) 194960 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 194985 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 194985 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 195014 Ribosomal protein L36e 195018 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 195042 Ribosomal protein L35Ae 195046 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 195071 Core histone H2A/H2B/H3/H4 195084 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 195149 L1 transposable element 195149 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 195150 Eukaryotic initiation factor 1A 195151 Eukaryotic initiation factor 1A 195152 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 195154 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 195155 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 195155 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 195155 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 195155 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 195156 L1 transposable element 195156 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 195156 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 195172 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195176 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195177 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195180 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195183 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195190 Cyclophilin type peptidyl-prolyl cis-trans isomerase 195208 Doublecortin 195211 Core histone H2A/H2B/H3/H4 195212 Core histone H2A/H2B/H3/H4 195212 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 195213 Core histone H2A/H2B/H3/H4 195221 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 195221 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 195221 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 195228 Core histone H2A/H2B/H3/H4 195229 Core histone H2A/H2B/H3/H4 195230 Core histone H2A/H2B/H3/H4 195230 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 195237 Core histone H2A/H2B/H3/H4 195240 ENV polyprotein (coat polyprotein) 195240 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 195240 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 195240 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 195252 HMG (high mobility group) box 195262 Core histone H2A/H2B/H3/H4 195266 Core histone H2A/H2B/H3/H4 195267 Core histone H2A/H2B/H3/H4 195267 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 195281 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195284 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195285 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195286 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195289 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195291 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195292 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195295 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195296 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195297 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195298 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195300 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195322 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 195322 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 195324 ENV polyprotein (coat polyprotein) 195324 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 195324 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 195324 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 195324 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 195333 Homeobox domain 195333 Homeobox domain 195349 L1 transposable element 195349 Uncharacterized ACR, COG1579 195349 Poly-adenylate binding protein, unique domain 195349 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 195349 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 195355 7 transmembrane receptor (rhodopsin family) 195356 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 195356 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 195359 Zinc finger, C3HC4 type (RING finger) 195363 7 transmembrane receptor (rhodopsin family) 195370 Ribosomal L29e protein family 195372 7 transmembrane receptor (rhodopsin family) 195401 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 195487 Ribosomal L10 195509 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 195509 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 195514 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 195522 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 195531 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 195533 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 195534 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 195559 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 195562 Peptidase family C54 195569 7 transmembrane receptor (rhodopsin family) 195576 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 195601 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195608 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195623 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 195623 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 195624 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 195646 Sulfotransferase protein 195671 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 195678 Translationally controlled tumor protein 195685 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 195686 ENV polyprotein (coat polyprotein) 195686 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 195686 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 195686 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 195686 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 195686 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 195686 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 195691 Sugar (and other) transporter 195691 HMG (high mobility group) box 195691 7 transmembrane receptor (metabotropic glutamate family) 195691 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 195701 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 195701 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 195707 ENV polyprotein (coat polyprotein) 195707 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195709 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195710 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 195712 RNA polymerase Rpb4 195726 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 195745 L1 transposable element 195746 L1 transposable element 195746 ENV polyprotein (coat polyprotein) 195751 L1 transposable element 195760 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 195760 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 195772 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 195772 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 195775 Renal dipeptidase 195783 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 195805 Homeobox domain 195805 CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease) 195809 L1 transposable element 195809 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 195814 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 195977 von Willebrand factor type A domain 195979 PH domain. PH stands for pleckstrin homology 195979 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 195979 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 196051 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 196114 7 transmembrane receptor (rhodopsin family) 196116 7 transmembrane receptor (rhodopsin family) 196117 7 transmembrane receptor (rhodopsin family) 196188 Ribosomal protein S5, C-terminal domain 196188 Ribosomal protein S5, N-terminal domain 196200 7 transmembrane receptor (rhodopsin family) 196226 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 196266 Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d 196294 Signal peptidase I 196335 7 transmembrane receptor (rhodopsin family) 196338 7 transmembrane receptor (rhodopsin family) 196346 HMG (high mobility group) box 196374 Intermediate filament protein 196376 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 196385 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 196400 C2 domain 196403 Zinc finger, C3HC4 type (RING finger) 196743 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 196883 Adenylate and Guanylate cyclase catalytic domain 196988 Dynamin GTPase effector domain 196996 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 197131 Putative zinc finger in N-recognin 197209 Eukaryotic-type carbonic anhydrase 197279 Ribosomal L10 197320 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 197350 ICE-like protease (caspase) p10 domain 197350 ICE-like protease (caspase) p20 domain 197358 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 199692 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 199699 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb 199709 Ribosomal S3Ae family 199718 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 199744 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 199746 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 199783 Trypsin 199799 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 199813 Sodium:neurotransmitter symporter family 199858 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 199873 Ribosomal protein L23 199974 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 199974 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 200008 Oxidoreductase FAD-binding domain 200008 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 200010 Sodium:solute symporter family 200014 C2 domain 200104 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 200132 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 200225 C2 domain 200226 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 200312 Zinc finger, C3HC4 type (RING finger) 200316 Cytidine and deoxycytidylate deaminase zinc-binding region 200316 Cytidine and deoxycytidylate deaminase zinc-binding region 200350 Fork head domain 200388 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 200391 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 200466 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 200504 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 200504 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 200539 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 200575 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 200580 Ribosomal protein L6e 200583 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 200722 Cytochrome c oxidase subunit III 200722 NADH-ubiquinone/plastoquinone oxidoreductase chain 4L 200734 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 200765 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 200765 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 200810 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 200845 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 200879 Lipase 200894 ADP-ribosylation factor family 200894 ADP-ribosylation factor family 200895 Dihydrofolate reductase 200909 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 200909 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 200909 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 200916 Ribosomal L22e protein family 200931 Domain of unknown function 200933 F-box domain 200933 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 200942 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 200959 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 200959 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 200972 ADP-ribosylation factor family 200972 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 201097 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 201140 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 201175 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 201176 PH domain. PH stands for pleckstrin homology 201176 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 201259 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 201294 C2 domain 201305 Sugar (and other) transporter 201382 Laminin N-terminal (Domain VI) 201382 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 201412 Cyclophilin type peptidyl-prolyl cis-trans isomerase 201475 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 201510 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 201516 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 201540 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 201595 Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase 201595 Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase 201625 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 201625 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 201654 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 201779 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 201784 Disintegrin 201784 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 201784 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 201798 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 201798 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 201829 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 201845 UDP-glucoronosyl and UDP-glucosyl transferase 201846 UDP-glucoronosyl and UDP-glucosyl transferase 201859 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 201878 Ribosomal protein L35Ae 201929 B-box zinc finger 201929 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 201989 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 201989 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 202083 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 202108 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 202108 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 202108 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 202122 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 202146 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 202158 Cytochrome C and Quinol oxidase polypeptide I 202227 Cyclophilin type peptidyl-prolyl cis-trans isomerase 202306 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 202319 Cyclophilin type peptidyl-prolyl cis-trans isomerase 202333 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 202366 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 202366 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 202500 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 202500 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 202538 Cytochrome C and Quinol oxidase polypeptide I 202757 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 202758 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 202758 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 202759 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 202789 pfam02921, UCR_TM, Ubiquinol cytochrome reductase transmembrane region. Each subunit of the cytochrome bc1 complex provides a single helix (this family) to make up the transmembrane region of the complex 202861 7 transmembrane receptor (rhodopsin family) 202915 Domain of unknown function 203010 NUDIX domain 203064 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 203068 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 203068 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 203068 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 203068 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 203068 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 203074 Trypsin 203082 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 203102 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 203102 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 203112 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 203190 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 203190 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 203202 Translationally controlled tumor protein 203249 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 203261 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 203326 7 transmembrane receptor (rhodopsin family) 203427 Mitochondrial carrier protein 203429 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 203429 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 203447 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 203510 HMG (high mobility group) box 203611 'chromo' (CHRromatin Organization MOdifier) domain 203611 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 203806 Fibronectin type III domain 203806 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 203818 Intermediate filament protein 204010 Ribosomal protein S2 204219 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 204219 Homeobox domain 204219 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 204275 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 204474 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 204801 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 205501 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 206358 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 207063 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 207119 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 207120 NUDIX domain 207120 NUDIX domain 207121 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 207121 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 207123 Uncharacterised protein family (UPF0183). This family of proteins includes Lin-10 from C. elegans 207126 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 207150 NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do n 207151 Sugar (and other) transporter 207152 7 transmembrane receptor (rhodopsin family) 207154 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 207155 7 transmembrane receptor (rhodopsin family) 207162 7 transmembrane receptor (rhodopsin family) 207165 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 207165 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 207177 Ribosomal protein L35Ae 207181 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 207187 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 207188 Actin 207194 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 207194 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 207194 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 207212 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 207213 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 207213 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 207217 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 207217 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 207217 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 207235 Translation initiation factor SUI1 207252 Eukaryotic initiation factor 4E 207259 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 207278 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 207278 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 207308 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 207318 Actin 207339 ENV polyprotein (coat polyprotein) 207350 von Willebrand factor type A domain 207357 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 207365 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 207371 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 207385 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 207390 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 207393 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 207408 Sulfatase 207415 Lipoate synthase. Lipoate synthase (or lipoic acid synthetase) catalyses the formation of alpha-(+)-lipoic acid, required for lipoate biosynthesis 207424 HMG (high mobility group) box 207474 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 207474 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 207492 Actin 207497 Clathrin adaptor complex small chain 207497 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 207544 Actin 207557 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 207558 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 207559 Lyase 207593 NAC domain 207595 Ribosomal protein S5, N-terminal domain 207595 Ribosomal protein S5, C-terminal domain 207602 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 207620 Helix-loop-helix DNA-binding domain 207624 HMG (high mobility group) box 207627 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 207629 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 207664 Cyclophilin type peptidyl-prolyl cis-trans isomerase 207667 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 207678 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 207695 Ribosomal protein S5, C-terminal domain 207695 Ribosomal protein S5, N-terminal domain 207704 GTP1/OBG family 207704 GTPase of unknown function 207704 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 207704 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 207728 3'5'-cyclic nucleotide phosphodiesterase 207728 GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases 207731 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 207745 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 207757 Ribosomal protein L6, N-terminal domain 207787 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 207790 7 transmembrane receptor (rhodopsin family) 207795 Alkaline phosphatase 207800 7 transmembrane receptor (rhodopsin family) 207806 Helix-loop-helix DNA-binding domain 207809 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 207810 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 207810 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 207839 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 207853 Tropomyosin 207867 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 207874 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 207885 Ribosomal protein L21e 207908 Hsp90 protein 207908 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 207911 7 transmembrane receptor (rhodopsin family) 207912 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 207913 ab-hydrolase associated lipase region 207913 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 207914 ab-hydrolase associated lipase region 207920 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 207923 Ribosomal protein L6e 207923 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 207924 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 207950 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 207952 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 207952 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 207952 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 207958 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 208000 7 transmembrane receptor (rhodopsin family) 208006 7 transmembrane receptor (rhodopsin family) 208025 7 transmembrane receptor (rhodopsin family) 208030 7 transmembrane receptor (rhodopsin family) 208042 7 transmembrane receptor (rhodopsin family) 208043 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 208043 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 208044 7 transmembrane receptor (rhodopsin family) 208045 7 transmembrane receptor (rhodopsin family) 208079 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 208079 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 208080 Actin 208089 HMG (high mobility group) box 208109 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 208109 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 208110 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 208110 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 208118 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 208118 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 208119 Ribosomal protein L3 208144 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 208144 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 208154 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 208166 TRAF-type zinc finger 208166 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 208166 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 208166 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 208166 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 208166 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 208169 Cyclic nucleotide-binding domain 208169 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 208188 7 transmembrane receptor (rhodopsin family) 208188 7 transmembrane receptor (rhodopsin family) 208211 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 208226 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 208231 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 208231 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 208256 Alkaline phosphatase 208264 ENV polyprotein (coat polyprotein) 208266 Intermediate filament protein 208268 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess 208269 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 208269 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 208285 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 208291 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 208292 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 208331 Cyclophilin type peptidyl-prolyl cis-trans isomerase 208409 Core histone H2A/H2B/H3/H4 208416 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 208416 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 208416 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 208419 Protein kinase domain 208424 7 transmembrane receptor (rhodopsin family) 208427 C2 domain 208428 Ribosomal protein L19e 208433 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 208439 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 208440 AMP-binding enzyme 208510 Ribosomal protein S6e 208514 ENV polyprotein (coat polyprotein) 208521 TSC-22/dip/bun family 208522 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 208524 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 208543 Ribosomal protein S6e 208584 Homeobox domain 208584 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 208595 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 208618 Actin interacting protein 3 208634 Tetraspanin family 208635 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 208638 Mitochondrial carrier protein 208641 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 208645 7 transmembrane receptor (rhodopsin family) 208648 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 208650 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 208652 HMG (high mobility group) box 208665 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 208669 Hantavirus glycoprotein G2. The medium (M) genome segment of hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins. G1 and G2, as a precursor protein in the complementary sense RNA 208674 Homeobox domain 208682 7 transmembrane receptor (rhodopsin family) 208691 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 208691 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 208715 Hydroxymethylglutaryl-coenzyme A synthase 208718 RNB-like protein. The function of this region of similarity is uncertain 208727 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 208731 7 transmembrane receptor (rhodopsin family) 208744 7 transmembrane receptor (rhodopsin family) 208748 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 208751 7 transmembrane receptor (rhodopsin family) 208757 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 208760 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 208777 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 208782 7 transmembrane receptor (rhodopsin family) 208795 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 208824 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 208859 Phosphatidylethanolamine-binding protein 208866 Ribosomal protein S2 208883 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 208894 ADP-ribosylation factor family 208895 ab-hydrolase associated lipase region 208895 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 208896 Ribosomal protein L36e 208919 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 208922 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 208927 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 208934 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 208936 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 208943 DIL domain. The DIL domain has no known function 208946 Translation initiation factor SUI1 208953 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 208967 Shikimate kinase 208967 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 208967 Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni 208976 ENV polyprotein (coat polyprotein) 208982 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 208982 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 208999 Ribosomal S17 209003 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 209027 Delta 1-pyrroline-5-carboxylate reductase 209031 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 209033 7 transmembrane receptor (rhodopsin family) 209036 7 transmembrane receptor (rhodopsin family) 209040 7 transmembrane receptor (rhodopsin family) 209043 7 transmembrane receptor (rhodopsin family) 209055 Protein kinase domain 209057 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 209088 Ribosomal protein S5, N-terminal domain 209091 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 209091 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 209098 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 209109 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 209109 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 209120 HMG (high mobility group) box 209131 PX domain. PX domains bind to phosphoinositides 209155 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 209176 Indoleamine 2,3-dioxygenase 209183 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 209200 Zinc finger, C3HC4 type (RING finger) 209202 ENV polyprotein (coat polyprotein) 209202 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 209202 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 209202 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 209203 Hsp90 protein 209230 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 209239 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 209265 Tropomyosin 209268 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 209268 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 209275 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 209281 F-actin capping protein alpha subunit 209294 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 209294 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 209296 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 209317 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 209324 Orbivirus VP3 (T2) protein. The orbivirus VP3 protein is part of the virus core and makes a 'subcore' shell made up of 120 copies of the 100K protein. VP3 particles can also bind RNA and are fundamental in the early stages of viral core formation. Also fo 209324 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 209324 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 209326 Orbivirus VP3 (T2) protein. The orbivirus VP3 protein is part of the virus core and makes a 'subcore' shell made up of 120 copies of the 100K protein. VP3 particles can also bind RNA and are fundamental in the early stages of viral core formation. Also fo 209326 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 209326 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 209334 XPG I-region 209334 XPG I-region 209334 XPG N-terminal domain 209351 WAP-type (Whey Acidic Protein) 'four-disulfide core' 209351 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 209354 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 209357 Transcription factor Tfb4 209357 Transcription factor Tfb4 209357 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex 209361 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 209371 Kinesin motor domain 209384 B-box zinc finger 209384 Zinc finger, C3HC4 type (RING finger) 209384 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 209387 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 209416 KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG 209416 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 209423 Ribosomal protein L19e 209442 Ribosomal L29e protein family 209444 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 209444 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 209446 Helix-loop-helix DNA-binding domain 209448 Homeobox domain 209449 7 transmembrane receptor (rhodopsin family) 209450 7 transmembrane receptor (rhodopsin family) 209456 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 209456 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 209464 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 209474 Cyclophilin type peptidyl-prolyl cis-trans isomerase 209488 SH2 domain 209501 Ribosomal protein L23 209508 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 209510 7 transmembrane receptor (rhodopsin family) 209511 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 209512 7 transmembrane receptor (rhodopsin family) 209513 7 transmembrane receptor (rhodopsin family) 209517 7 transmembrane receptor (rhodopsin family) 209524 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 209524 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 209540 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 209541 Actin 209542 mRNA capping enzyme, C-terminal domain 209542 mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme 209542 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 209558 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 209584 Uncharacterized ACR, COG1590 209586 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 209586 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 209589 Ribosomal protein S7e 209591 Acyl CoA binding protein 209607 Protein kinase domain 209630 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 209632 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 209632 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 209633 Ribosomal L22e protein family 209636 Ribosomal protein L21e 209637 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 209654 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 209674 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 209685 Ribosomal protein L19e 209692 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 209692 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 209714 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 209714 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 209737 Kinesin motor domain 209737 Kinesin motor domain 209742 von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA 209749 7 transmembrane receptor (rhodopsin family) 209749 7 transmembrane receptor (rhodopsin family) 209751 7 transmembrane receptor (rhodopsin family) 209753 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 209754 7 transmembrane receptor (rhodopsin family) 209761 7 transmembrane receptor (rhodopsin family) 209762 Class I Histocompatibility antigen, domains alpha 1 and 2 209764 7 transmembrane receptor (rhodopsin family) 209766 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 209768 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 209770 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 209772 Thymidine kinase 209773 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 209773 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 209773 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 209776 7 transmembrane receptor (rhodopsin family) 209782 7 transmembrane receptor (rhodopsin family) 209783 7 transmembrane receptor (rhodopsin family) 209786 7 transmembrane receptor (metabotropic glutamate family) 209787 7 transmembrane receptor (rhodopsin family) 209791 MCM2/3/5 family 209793 7 transmembrane receptor (rhodopsin family) 209794 ABC1 family. This family includes ABC1 from yeast and AarF from E. coli. These proteins have a nuclear or mitochondrial subcellular location in eukaryotes. The exact molecular functions of these proteins is not clear, however yeast ABC1 suppresses a cytoc 209807 7 transmembrane receptor (rhodopsin family) 209808 7 transmembrane receptor (rhodopsin family) 209824 7 transmembrane receptor (rhodopsin family) 209824 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 209824 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 209824 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 209837 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 209857 ENV polyprotein (coat polyprotein) 209861 PMP-22/EMP/MP20/Claudin family 209865 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 209871 Ribosomal S3Ae family 209887 ENV polyprotein (coat polyprotein) 209888 Vitamin B12 dependent methionine synthase, activation domain 209901 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 209917 HMG (high mobility group) box 209926 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 209988 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 210004 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 210004 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 210009 Flavodoxin 210030 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 210030 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 210044 Adenylate and Guanylate cyclase catalytic domain 210044 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum 210045 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 210064 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 210066 Hsp90 protein 210094 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 210103 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 210103 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 210103 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 210104 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 210105 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 210105 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 210106 PAP/25A associated domain 210106 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 210126 LIM domain. This family represents two copies of the LIM structural domain 210129 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 210131 NADH:flavin oxidoreductase / NADH oxidase family 210145 GTPase of unknown function 210145 Protein of unknown function, DUF258 210145 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 210147 Phospholipase A2 inhibitor 210157 Ribosomal protein S6e 210162 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 210162 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 210172 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 210191 ENV polyprotein (coat polyprotein) 210197 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 210198 7 transmembrane receptor (metabotropic glutamate family) 210198 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 210212 ENV polyprotein (coat polyprotein) 210240 ENV polyprotein (coat polyprotein) 210240 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 210240 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 210240 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 210243 Eukaryotic ribosomal protein L18 210243 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 210265 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 210271 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 210271 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 210274 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 210282 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 210291 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 210297 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 210321 NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins 210351 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 210358 Homeobox domain 210364 Ribosomal protein L23 210381 Protein kinase domain 210389 Aminotransferase class I and II 210411 7 transmembrane receptor (metabotropic glutamate family) 210411 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 210413 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 210413 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 210413 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 210424 Thrombospondin type 1 domain 210429 Core histone H2A/H2B/H3/H4 210457 LIM domain. This family represents two copies of the LIM structural domain 210463 Sugar (and other) transporter 210465 Homeobox domain 210465 Sugar (and other) transporter 210465 7 transmembrane receptor (metabotropic glutamate family) 210465 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 210467 7 transmembrane receptor (metabotropic glutamate family) 210475 ENV polyprotein (coat polyprotein) 210477 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 210482 Ribosomal protein S5, C-terminal domain 210482 Ribosomal protein S5, N-terminal domain 210497 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 210497 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 210503 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 210507 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 210507 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 210510 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 210520 Glycosyl hydrolase family 1 210534 Ribosomal protein L21e 210554 Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a 210564 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 210582 Uncharacterised protein family (UPF0083) 210588 Ribosomal protein S12 210599 Cytochrome C oxidase subunit II, periplasmic domain 210602 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 210602 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 210619 Protein kinase domain 210622 CUB domain 210622 Trypsin 210622 CUB domain 210622 Sushi domain (SCR repeat) 210668 Ribosomal protein S24e 210670 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 210708 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 210708 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 210709 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 210714 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 210722 Ribosomal protein S2 210742 L1 transposable element 210758 7 transmembrane receptor (rhodopsin family) 210758 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 210760 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 210772 Sec34-like family. Sec34 and Sec35 form a sub-complex, in a seven protein complex that includes Dor1 (PFAM:PF04124). This complex is thought to be important for tether vesicles to the Golgi 210775 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 210779 Ribosomal protein L21e 210784 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 210827 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 210853 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 210853 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 210869 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 210876 7 transmembrane receptor (metabotropic glutamate family) 210876 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 210905 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 210933 7 transmembrane receptor (Secretin family) 210933 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 210933 7 transmembrane receptor (Secretin family) 210933 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 210933 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 210964 TPR Domain 210973 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 210976 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 210988 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 210994 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 211015 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 211027 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 211036 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 211105 Profilin 211153 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 211187 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 211191 Hsp90 protein 211208 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 211208 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 211223 7 transmembrane receptor (metabotropic glutamate family) 211223 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 211232 C2 domain 211232 C2 domain 211247 Ribosomal protein S8e 211255 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 211255 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 211266 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 211280 Hsp90 protein 211296 Fructose-bisphosphate aldolase class-I 211300 Trypsin 211303 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 211303 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 211305 F-box domain 211305 F-box domain 211321 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 211321 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 211321 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 211331 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 211332 Kinesin motor domain 211347 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 211362 7 transmembrane receptor (metabotropic glutamate family) 211362 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 211370 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 211389 Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure 211389 Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity 211389 Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure 211389 Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity 211389 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 211391 Ribosomal protein L11, RNA binding domain 211398 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 211398 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 211398 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 211421 Ribosomal protein S19 211430 C2 domain 211437 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 211437 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 211441 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 211457 7 transmembrane receptor (rhodopsin family) 211468 Cyclic nucleotide-binding domain 211468 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 211468 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 211469 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 211472 7 transmembrane receptor (rhodopsin family) 211480 Inward rectifier potassium channel 211484 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 211484 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 211490 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 211496 Protein kinase domain 211502 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 211539 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 211550 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 211551 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 211551 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 211552 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 211552 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 211563 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 211565 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 211566 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 211566 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 211572 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 211572 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 211572 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 211572 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 211577 7 transmembrane receptor (rhodopsin family) 211578 7 transmembrane receptor (rhodopsin family) 211586 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 211586 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 211587 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 211587 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 211598 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 211603 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 211603 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 211603 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 211612 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 211614 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 211622 Polyphosphate kinase. Polyphosphate kinase (Ppk) catalyzes the formation of polyphosphate from ATP, with chain lengths of up to a thousand or more orthophosphate molecules 211652 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 211666 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 211666 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 211669 Protein kinase domain 211669 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 211673 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 211685 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 211700 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 211700 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 211702 Zinc finger, C3HC4 type (RING finger) 211705 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 211706 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 211712 Cadherin domain 211725 Glutamine synthetase, catalytic domain 211726 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 211727 Defensin propeptide 211746 Ribosomal protein L21e 211757 Transforming growth factor beta like domain 211757 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 211776 Ribosomal S17 211795 7 transmembrane receptor (rhodopsin family) 211804 Phosphoglycerate mutase family 211832 NAD-dependent glycerol-3-phosphate dehydrogenase 211838 START domain 211857 Pou domain - N-terminal to homeobox domain 211870 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 211895 HMG (high mobility group) box 211903 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 211914 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 211914 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 211924 Cadherin domain 211924 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 211924 Cadherin domain 211924 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 211925 Ribosomal protein S5, N-terminal domain 211925 Ribosomal protein S5, C-terminal domain 211945 PH domain. PH stands for pleckstrin homology 211945 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 211948 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 211970 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 211973 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about 211980 7 transmembrane receptor (metabotropic glutamate family) 211980 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 211981 Ribosomal family S4e 211983 Protein kinase domain 212007 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 212030 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 212030 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 212034 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 212034 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 212043 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 212044 Ribosomal L29e protein family 212069 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 212084 Ribosomal protein S19 212085 Zinc finger, C3HC4 type (RING finger) 212111 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 212117 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212117 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 212117 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 212117 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 212122 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 212129 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 212129 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 212130 Proteasome A-type and B-type 212149 ENV polyprotein (coat polyprotein) 212190 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 212198 WD domain, G-beta repeat 212205 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 212207 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 212211 ENV polyprotein (coat polyprotein) 212211 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212211 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 212211 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 212218 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 212236 Ribosomal protein S5, N-terminal domain 212237 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 212252 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 212266 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 212266 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 212266 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 212267 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 212271 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212271 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 212271 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 212276 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 212278 HMG (high mobility group) box 212281 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 212282 7 transmembrane receptor (rhodopsin family) 212282 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 212285 PH domain. PH stands for pleckstrin homology 212285 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 212285 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 212285 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 212289 7 transmembrane receptor (rhodopsin family) 212313 Pyruvate kinase, alpha/beta domain 212313 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 212313 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 212351 ENV polyprotein (coat polyprotein) 212366 MAS20 protein import receptor 212390 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 212399 Actin 212413 Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of t 212419 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212419 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 212419 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 212419 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 212429 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 212429 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 212432 Calx-beta domain 212434 Cyclophilin type peptidyl-prolyl cis-trans isomerase 212437 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 212437 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 212446 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 212449 ENV polyprotein (coat polyprotein) 212460 Actin 212464 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 212476 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 212477 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 212503 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 212507 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 212508 GTPase of unknown function 212508 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 212508 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 212515 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 212528 Conserved hypothetical protein 95 212528 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 212528 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 212528 N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e 212539 ADP-ribosylation factor family 212539 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 212541 7 transmembrane receptor (rhodopsin family) 212548 7 transmembrane receptor (rhodopsin family) 212549 7 transmembrane receptor (rhodopsin family) 212555 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How 212556 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 212558 7 transmembrane receptor (rhodopsin family) 212561 7 transmembrane receptor (rhodopsin family) 212562 7 transmembrane receptor (rhodopsin family) 212576 7 transmembrane receptor (rhodopsin family) 212607 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 212608 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 212618 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m 212632 Intermediate filament protein 212634 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 212647 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 212670 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 212677 Homeobox domain 212677 'Paired box' domain 212679 tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes 212679 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 212679 tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases 212684 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 212692 TMS membrane protein/tumour differentially expressed protein (TDE) 212699 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 212700 Domain of unknown function 212704 Ribosomal protein L35Ae 212710 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 212712 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 212718 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 212722 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 212725 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 212725 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 212727 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 212727 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 212741 Thymidine kinase 212744 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212744 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 212746 Ribosomal protein L31e 212747 WD domain, G-beta repeat 212763 Core histone H2A/H2B/H3/H4 212782 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 212806 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 212806 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 212806 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 212808 Cyclophilin type peptidyl-prolyl cis-trans isomerase 212819 ENV polyprotein (coat polyprotein) 212819 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212819 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 212819 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 212819 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 212819 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 212819 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 212844 Ribosomal protein L21e 212866 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 212866 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 212870 7 transmembrane receptor (rhodopsin family) 212870 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 212871 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 212884 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 212884 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 212915 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 212919 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 212921 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 212923 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 212932 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 212933 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 212941 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 212950 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 212950 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 212950 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 212950 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 212952 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 212958 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 212961 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 212970 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 212974 Glycosyl hydrolase family 65 central catalytic domain. This family of glycosyl hydrolases contains vacuolar acid trehalase and maltose phosphorylase.Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic pho 212989 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 212996 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 212997 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 212997 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 213001 7 transmembrane receptor (rhodopsin family) 213014 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 213030 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 213032 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 213037 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 213038 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 213038 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 213043 [2Fe-2S] binding domain 213043 CO dehydrogenase flavoprotein C-terminal domain 213043 FAD binding domain in molybdopterin dehydrogenase 213043 Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain 213043 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 213051 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213053 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 213070 Chaperonin 10 Kd subunit 213070 Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module 213079 HMG (high mobility group) box 213093 Ribosomal protein L34e 213105 Ribosomal protein L19e 213109 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 213119 von Willebrand factor type A domain 213119 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 213120 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 213121 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 213121 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 213160 Ribosomal protein L21e 213177 Translation initiation factor SUI1 213179 Translation initiation factor SUI1 213212 Trefoil (P-type) domain 213212 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 213220 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 213251 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213268 Ribosomal protein L19e 213272 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 213286 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 213287 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 213288 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 213288 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 213299 Ribonucleotide reductase, small chain 213311 F-box domain 213320 HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins 213326 Protein kinase domain 213352 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 213363 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 213363 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 213367 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 213391 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 213404 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213409 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 213411 Protein-tyrosine phosphatase 213411 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 213416 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 213420 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 213422 Ribosomal L28e protein family 213423 7 transmembrane receptor (rhodopsin family) 213426 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 213428 Fibronectin type III domain 213435 Protein kinase domain 213436 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 213436 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 213437 7 transmembrane receptor (metabotropic glutamate family) 213438 7 transmembrane receptor (rhodopsin family) 213438 7 transmembrane receptor (rhodopsin family) 213439 7 transmembrane receptor (rhodopsin family) 213440 HMG (high mobility group) box 213451 Tropomyosin 213454 Zinc finger, C3HC4 type (RING finger) 213469 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 213469 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 213474 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 213478 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 213480 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 213481 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 213484 NUDIX domain 213498 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 213499 F-box domain 213499 F-box domain 213499 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 213514 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 213514 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213522 PH domain. PH stands for pleckstrin homology 213522 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 213527 7 transmembrane receptor (Secretin family) 213527 7 transmembrane receptor (Secretin family) 213533 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 213533 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 213539 BAG domain. Domain present in Hsp70 regulators 213541 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 213556 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 213556 PH domain. PH stands for pleckstrin homology 213556 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 213556 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 213560 Ribosomal protein L6 213570 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 213584 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 213586 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 213586 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 213587 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 213595 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213611 Protein-tyrosine phosphatase 213638 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213684 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 213693 Actin 213694 ENV polyprotein (coat polyprotein) 213701 Ribosomal protein L10 213702 Protein kinase domain 213758 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 213788 7 transmembrane receptor (rhodopsin family) 213826 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 213826 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 213827 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 213856 Core histone H2A/H2B/H3/H4 213878 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 213879 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 213881 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 213881 CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily con 213881 CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserv 213898 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 213923 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 213943 von Willebrand factor type A domain 213943 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 213943 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 213945 von Willebrand factor type A domain 213946 WD domain, G-beta repeat 213973 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 213973 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 213977 HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins 213985 Ribosomal protein S6e 213990 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 213994 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 214029 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 214036 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 214038 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron 214062 Ribosomal protein L6 214066 Zn-finger in Ran binding protein and others 214072 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 214111 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 214137 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 214137 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 214141 Ribosomal protein L11, RNA binding domain 214141 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 214144 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 214149 jmjN domain 214149 jmjC domain 214150 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 214150 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 214151 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 214158 B-box zinc finger 214158 Zinc finger, C3HC4 type (RING finger) 214158 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 214158 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a 214162 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 214162 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 214162 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 214162 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 214166 Ribosomal protein L23 214198 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 214230 Protein kinase domain 214230 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 214245 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 214247 ENV polyprotein (coat polyprotein) 214253 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 214253 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 214253 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 214270 Ribosomal protein L3 214278 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 214286 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 214290 PAP/25A associated domain 214292 ENV polyprotein (coat polyprotein) 214292 ENV polyprotein (coat polyprotein) 214299 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 214301 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 214320 Sulfotransferase protein 214321 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 214321 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 214362 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 214377 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 214377 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 214377 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 214384 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 214384 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 214390 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 214401 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 214403 Sushi domain (SCR repeat) 214410 Clathrin light chain 214421 Sushi domain (SCR repeat) 214434 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 214447 WD domain, G-beta repeat 214450 LIM domain. This family represents two copies of the LIM structural domain 214456 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 214461 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 214461 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 214521 Ribosomal protein L21e 214522 ENV polyprotein (coat polyprotein) 214531 Trypsin 214531 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 214534 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 214534 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 214545 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 214545 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 214575 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 214590 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 214590 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 214593 Animal haem peroxidase 214593 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 214593 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 214603 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E 214603 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 214611 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 214614 Fatty acid desaturase 214616 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 214620 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 214627 PAP/25A associated domain 214627 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 214652 Zinc finger, C3HC4 type (RING finger) 214658 ENV polyprotein (coat polyprotein) 214663 Mitochondrial carrier protein 214680 ENV polyprotein (coat polyprotein) 214680 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 214682 Myosin head (motor domain) 214686 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 214695 Ribosomal protein S6e 214714 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 214728 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 214732 Ribosomal protein S2 214738 Ribosomal protein S6e 214739 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 214739 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 214742 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 214748 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 214748 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 214759 Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation 214761 Proteasome A-type and B-type 214766 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 214767 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 214767 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 214783 Fibronectin type III domain 214783 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 214795 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 214804 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 214849 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 214880 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 214891 ENV polyprotein (coat polyprotein) 214891 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 214895 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-2 214897 Protein kinase domain 214899 jmjC domain 214899 jmjN domain 214899 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 214899 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 214899 jmjC domain 214899 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 214899 pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function 214905 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 214913 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 214922 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 214923 Polyprenyl synthetase 214944 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 214944 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 214951 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 214952 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 214967 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 214968 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 214973 Cyclophilin type peptidyl-prolyl cis-trans isomerase 214976 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 214976 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 215001 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 215028 Somatotropin hormone family 215029 Somatotropin hormone family 215034 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 215052 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 215061 B-box zinc finger 215061 Zinc finger, C3HC4 type (RING finger) 215061 B-box zinc finger 215061 Zinc finger, C3HC4 type (RING finger) 215061 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 215065 ENV polyprotein (coat polyprotein) 215065 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 215065 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 215078 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 215085 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 215086 Transaldolase 215093 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 215095 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil 215098 Ribosomal protein S17 215157 Sushi domain (SCR repeat) 215159 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 215160 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 215166 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 215166 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 215192 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 215196 Ribosomal protein S11 215204 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 215208 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 215231 Cyclophilin type peptidyl-prolyl cis-trans isomerase 215232 Cyclophilin type peptidyl-prolyl cis-trans isomerase 215253 Zinc-binding dehydrogenase 215253 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 215282 Galactose binding lectin domain 215288 Proteasome A-type and B-type 215332 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 215335 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 215337 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 215338 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215338 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 215339 Ribosomal protein S5, N-terminal domain 215339 Ribosomal protein S5, C-terminal domain 215351 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 215378 Giardia variant-specific surface protein 215393 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 215394 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 215405 GTPase of unknown function 215405 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 215405 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 215408 Thrombospondin type 1 domain 215408 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 215411 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215419 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 215422 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 215422 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 215424 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 215428 CUB domain 215428 Sushi domain (SCR repeat) 215435 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 215446 GDA1/CD39 (nucleoside phosphatase) family 215446 Protein kinase domain 215446 GDA1/CD39 (nucleoside phosphatase) family 215449 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 215452 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 215493 Glycosyltransferase family 6 215495 Ribosomal protein S5, C-terminal domain 215495 Ribosomal protein S5, N-terminal domain 215539 Vinculin family 215539 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 215539 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 215544 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 215562 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215562 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 215605 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 215615 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 215621 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 215627 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 215632 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 215632 PH domain. PH stands for pleckstrin homology 215632 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 215640 Ribosomal protein L34e 215641 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 215641 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 215653 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 215653 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 215654 Cadherin domain 215654 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 215667 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 215678 Ribosomal L29e protein family 215695 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 215705 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp 215705 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 215734 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 215777 ENV polyprotein (coat polyprotein) 215791 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215791 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 215797 7 transmembrane receptor (Secretin family) 215798 CUB domain 215798 Pentaxin family. Pentaxins are also known as pentraxins 215802 HMG (high mobility group) box 215806 Cornichon protein 215823 ENV polyprotein (coat polyprotein) 215823 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 215844 FKBP-type peptidyl-prolyl cis-trans isomerase 215845 Ribosomal L29e protein family 215845 Cyclophilin type peptidyl-prolyl cis-trans isomerase 215852 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 215852 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 215853 Aconitase family (aconitate hydratase) 215854 7 transmembrane receptor (rhodopsin family) 215855 7 transmembrane receptor (rhodopsin family) 215856 7 transmembrane receptor (rhodopsin family) 215857 7 transmembrane receptor (rhodopsin family) 215859 7 transmembrane receptor (rhodopsin family) 215861 7 transmembrane receptor (rhodopsin family) 215865 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 215866 ENV polyprotein (coat polyprotein) 215867 ENV polyprotein (coat polyprotein) 215869 ENV polyprotein (coat polyprotein) 215871 Phosphatidylethanolamine-binding protein 215872 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 215873 ENV polyprotein (coat polyprotein) 215877 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 215890 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 215890 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 215890 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 215890 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 215892 ENV polyprotein (coat polyprotein) 215892 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 215893 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 215893 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 215894 ENV polyprotein (coat polyprotein) 215894 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 215894 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 215894 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 215894 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 215894 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 215895 Sulfotransferase protein 215896 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215896 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 215905 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 215906 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 215913 Ribosomal protein S26e 215919 Zinc finger, C3HC4 type (RING finger) 215920 Ribosomal S17 215922 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 215941 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 215944 7 transmembrane receptor (rhodopsin family) 215946 Adenylate kinase 215948 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 215950 Uracil DNA glycosylase superfamily 215962 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215963 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 215969 Triosephosphate isomerase 215974 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 215974 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 215977 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 215979 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 215979 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 215979 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 215988 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 215988 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 215988 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 216019 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 216020 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 216020 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 216024 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 216033 Vinculin family 216033 Vinculin family 216036 Ribosomal protein S6e 216057 Enolase, C-terminal TIM barrel domain 216081 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 216081 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 216082 Ets-domain 216099 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 216115 Sodium:solute symporter family 216134 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess 216135 7 transmembrane receptor (metabotropic glutamate family) 216136 Thiamine pyrophosphate enzyme, C-terminal TPP binding domain 216136 Thiamine pyrophosphate enzyme, N-terminal TPP binding domain 216136 Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold 216136 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 216136 Thiamine pyrophosphate enzyme, C-terminal TPP binding domain 216136 Thiamine pyrophosphate enzyme, N-terminal TPP binding domain 216136 Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold 216136 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 216142 7 transmembrane receptor (rhodopsin family) 216143 7 transmembrane receptor (rhodopsin family) 216148 SH2 domain 216148 Phosphotyrosine interaction domain (PTB/PID) 216150 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 216152 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 216152 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 216154 WD domain, G-beta repeat 216156 WD domain, G-beta repeat 216166 Protein kinase domain 216166 POLO box duplicated region 216177 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 216178 Protein kinase domain 216179 Clathrin adaptor complex small chain 216179 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 216185 Transcription initiation factor TFIID 23-30kDa subunit 216188 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 216190 PH domain. PH stands for pleckstrin homology 216210 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 216223 Fibronectin type III domain 216224 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 216229 WD domain, G-beta repeat 216238 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 216244 Transaldolase 216282 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 216285 Homeobox domain 216294 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 216299 von Willebrand factor type D domain 216301 Ribosomal protein S6e 216317 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 216330 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 216334 Ribosomal protein L21e 216343 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 216344 ADP-ribosylation factor family 216344 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 216347 Ribosomal S3Ae family 216350 Tetraspanin family 216363 Uncharacterized ACR, COG1579 216363 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 216363 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 216390 Ribosomal protein L21e 216394 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 216394 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 216416 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 216439 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216439 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 216439 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216439 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 216441 Sulfate transporter family. Mutations may lead to several human diseases 216443 Vinculin family 216443 WHEP-TRS domain 216443 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 216443 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 216453 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 216454 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 216456 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216456 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216460 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 216467 7 transmembrane receptor (rhodopsin family) 216472 7 transmembrane receptor (rhodopsin family) 216483 7 transmembrane receptor (rhodopsin family) 216502 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 216502 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 216505 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 216524 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 216530 Adenylate and Guanylate cyclase catalytic domain 216542 ENV polyprotein (coat polyprotein) 216544 Ribosomal protein L11, RNA binding domain 216544 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 216546 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 216546 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 216550 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 216550 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 216551 Galactoside-binding lectin 216558 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri 216600 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 216600 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 216600 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 216600 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 216605 ENV polyprotein (coat polyprotein) 216605 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 216605 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 216605 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 216605 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 216610 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 216622 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216622 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216635 Globin 216635 Globin 216635 Globin 216636 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 216636 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 216639 Ribosomal protein S2 216643 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 216643 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 216672 Ribosomal L22e protein family 216674 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 216678 HMG (high mobility group) box 216681 NAC domain 216691 Ribosomal protein S5, N-terminal domain 216696 ENV polyprotein (coat polyprotein) 216704 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 216705 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 216707 Ribosomal L39 protein 216713 7 transmembrane receptor (rhodopsin family) 216715 7 transmembrane receptor (rhodopsin family) 216716 7 transmembrane receptor (rhodopsin family) 216717 7 transmembrane receptor (rhodopsin family) 216720 7 transmembrane receptor (rhodopsin family) 216721 7 transmembrane receptor (rhodopsin family) 216721 1,3-beta-glucan synthase component. This family consists of various 1,3-beta-glucan synthase components including Gls1, Gls2 and Gls3 from yeast. 1,3-beta-glucan synthase EC:2.4.1.34 also known as callose synthase catalyses the formation of a beta-1,3-glu 216724 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 216724 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 216725 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 216725 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 216725 Thrombospondin type 1 domain 216725 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 216726 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 216732 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 216749 7 transmembrane receptor (rhodopsin family) 216763 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M 216767 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 216772 Phospholipase A2 inhibitor 216774 7 transmembrane receptor (rhodopsin family) 216781 B-box zinc finger 216781 Zinc finger, C3HC4 type (RING finger) 216781 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 216782 7 transmembrane receptor (rhodopsin family) 216783 7 transmembrane receptor (rhodopsin family) 216784 7 transmembrane receptor (rhodopsin family) 216785 7 transmembrane receptor (rhodopsin family) 216787 7 transmembrane receptor (rhodopsin family) 216788 7 transmembrane receptor (rhodopsin family) 216790 Protein kinase domain 216792 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 216795 wnt family 216797 Trypsin 216798 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 216799 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 216800 HMG (high mobility group) box 216800 7 transmembrane receptor (rhodopsin family) 216802 7 transmembrane receptor (rhodopsin family) 216803 7 transmembrane receptor (rhodopsin family) 216810 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 216814 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 216817 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 216818 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 216818 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 216825 Ubiquitin carboxyl-terminal hydrolase family 2 216825 Ubiquitin carboxyl-terminal hydrolases family 2 216825 Zn-finger in ubiquitin-hydrolases and other protein 216827 Ribosomal protein S15 216831 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 216834 HIT family 216835 Ubiquitin carboxyl-terminal hydrolase family 2 216835 Ubiquitin carboxyl-terminal hydrolases family 2 216850 jmjC domain 216851 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 216851 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 216856 Carboxylesterase 216858 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 216859 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 216864 Hepatic lectin, N-terminal domain 216869 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 216869 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 216874 CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are ass 216874 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 216874 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 216877 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 216877 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 216881 Sulfotransferase protein 216881 Sulfotransferase protein 216881 WSC domain. This domain may be involved in carbohydrate binding 216884 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 216886 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 216900 7 transmembrane receptor (rhodopsin family) 216904 7 transmembrane receptor (rhodopsin family) 216909 7 transmembrane receptor (rhodopsin family) 216927 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 216928 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 216940 7 transmembrane receptor (rhodopsin family) 216953 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 216959 Helix-loop-helix DNA-binding domain 216961 WD domain, G-beta repeat 216963 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216963 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 216963 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 216976 Protein kinase domain 216991 PH domain. PH stands for pleckstrin homology 216991 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 217031 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 217057 Domain of unknown function UPF0099. This domain has no known function 217066 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 217069 Zinc finger, C3HC4 type (RING finger) 217069 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 217069 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 217092 ENV polyprotein (coat polyprotein) 217114 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 217119 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 217123 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 217123 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 217127 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 217127 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 217127 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 217134 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 217151 ADP-ribosylation factor family 217151 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 217151 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 217154 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 217154 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 217164 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 217166 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 217166 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 217167 Connexin 217175 Intermediate filament protein 217175 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 217177 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217177 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217194 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 217194 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 217198 PH domain. PH stands for pleckstrin homology 217212 Pancreatic hormone peptide 217229 Ribosomal protein S5, C-terminal domain 217229 Ribosomal protein S5, N-terminal domain 217245 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 217246 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 217255 Ribosomal protein S7e 217280 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 217302 7 transmembrane receptor (rhodopsin family) 217302 7 transmembrane receptor (rhodopsin family) 217322 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 217328 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 217333 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 217342 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 217342 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 217344 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 217351 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 217351 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 217364 Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates 217369 7 transmembrane receptor (rhodopsin family) 217376 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 217377 Cyclophilin type peptidyl-prolyl cis-trans isomerase 217378 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con 217379 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 217383 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 217384 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 217390 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 217397 Formate--tetrahydrofolate ligase 217397 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 217404 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 217409 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217409 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217451 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217451 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217463 PX domain. PX domains bind to phosphoinositides 217463 PXA domain. This domain is associated with PX domains pfam00787 217463 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 217480 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 217480 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 217480 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 217480 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 217493 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 217495 ENV polyprotein (coat polyprotein) 217503 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 217536 Ribosomal RNA adenine dimethylase 217540 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217540 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217547 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217547 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217558 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 217566 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 217573 Ribosomal protein L21e 217576 Ribosomal protein L31e 217578 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 217584 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217584 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217586 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 217593 Mitochondrial carrier protein 217614 Ribosomal protein S5, C-terminal domain 217614 Ribosomal protein S5, N-terminal domain 217616 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217616 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217628 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 217628 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217630 Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex 217632 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 217647 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 217670 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 217674 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 217682 PH domain. PH stands for pleckstrin homology 217684 Sushi domain (SCR repeat) 217690 Protein kinase domain 217690 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 217692 Rap/ran-GAP 217693 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 217695 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 217696 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 217702 ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401 217715 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac 217733 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 217738 Thrombospondin type 1 domain 217766 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 217767 HMG (high mobility group) box 217777 Phosphoglycerate kinase 217779 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 217779 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 217786 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 217786 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 217790 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 217810 Ribosomal protein S5, C-terminal domain 217826 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 217835 Vacuolar sorting protein 9 (VPS9) domain 217835 Vacuolar sorting protein 9 (VPS9) domain 217847 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 217849 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 217852 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 217854 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 217864 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 217864 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 217866 Tropomyosin 217866 Protein kinase domain 217866 Intermediate filament protein 217866 Uncharacterized ACR, COG1579 217866 Protein kinase C terminal domain 217866 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 217866 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 217866 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 217866 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 217866 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E 217866 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 217866 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 217866 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 217869 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 217869 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 217869 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 217886 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 217908 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217916 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217918 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217921 HMG (high mobility group) box 217922 HMG (high mobility group) box 217926 HMG (high mobility group) box 217928 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217929 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 217929 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 217930 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217931 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217932 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 217944 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 217957 Transcription initiation factor IIA, gamma subunit, beta-barrel domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multi 217957 Transcription initiation factor IIA, gamma subunit, helical domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multimeri 217958 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 217963 Ribosomal L29e protein family 217991 Ribosomal protein L21e 217999 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 218005 Elongation factor 1 gamma, conserved domain 218006 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 218024 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 218025 ADP-ribosylation factor family 218030 Homeobox domain 218038 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 218038 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 218039 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 218051 7 transmembrane receptor (rhodopsin family) 218054 7 transmembrane receptor (rhodopsin family) 218060 Ribosomal L29e protein family 218065 7 transmembrane receptor (rhodopsin family) 218066 7 transmembrane receptor (rhodopsin family) 218068 7 transmembrane receptor (rhodopsin family) 218075 Protein kinase domain 218110 Somatotropin hormone family 218146 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 218171 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 218184 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 218184 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218184 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 218184 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 218203 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 218203 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 218210 Zn-finger in Ran binding protein and others 218210 Zn-finger in Ran binding protein and others 218214 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 218241 Protein kinase domain 218242 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218243 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218243 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218244 Protein kinase domain 218268 Eukaryotic initiation factor 4E 218269 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains 218269 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 218271 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 218272 Protein kinase domain 218274 Ribosomal protein L21e 218275 Papain family cysteine protease 218275 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 218276 Malic enzyme, NAD binding domain 218282 Ribosomal protein L35Ae 218297 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 218298 SKIP/SNW domain. This domain is found in chromatin proteins 218299 Cyclophilin type peptidyl-prolyl cis-trans isomerase 218304 Trypsin 218307 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 218307 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218310 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 218313 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 218314 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 218317 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 218317 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 218322 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218322 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218324 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 218326 Ribosomal protein L10 218332 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218332 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218340 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 218341 Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines 218354 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 218357 ENV polyprotein (coat polyprotein) 218357 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 218357 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 218358 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 218358 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 218359 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 218360 Low molecular weight phosphotyrosine protein phosphatase 218362 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 218363 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 218363 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218369 Actin 218372 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 218385 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 218385 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 218397 SH2 domain 218397 PH domain. PH stands for pleckstrin homology 218397 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 218402 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218402 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218411 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 218411 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 218411 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 218411 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 218412 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218412 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218419 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 218420 Hsp90 protein 218426 Ribosomal protein L23 218428 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 218428 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 218428 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 218432 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 218432 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 218440 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 218440 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 218441 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 218442 TMS membrane protein/tumour differentially expressed protein (TDE) 218447 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218447 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218456 Nucleoside diphosphate kinase 218461 3'5'-cyclic nucleotide phosphodiesterase 218473 EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of tr 218476 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 218476 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 218499 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 218501 Calponin family repeat 218501 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 218518 Occludin/ELL family 218518 Occludin/ELL family 218523 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218523 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218524 HMG (high mobility group) box 218538 Ets-domain 218569 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 218580 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 218581 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 218606 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 218606 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 218613 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 218614 Protein kinase domain 218614 Zinc finger, C3HC4 type (RING finger) 218617 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 218620 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218624 Fibronectin type III domain 218630 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 218630 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 218631 Trypsin 218637 Ribosomal L39 protein 218639 ADP-ribosylation factor family 218658 Ribosomal protein L3 218663 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 218693 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 218699 PX domain. PX domains bind to phosphoinositides 218704 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218728 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218728 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218741 Ribosomal protein L19e 218744 Cadherin domain 218750 HMG (high mobility group) box 218751 7 transmembrane receptor (rhodopsin family) 218757 Protein kinase domain 218759 Protein kinase domain 218761 ENV polyprotein (coat polyprotein) 218769 Ribosomal protein L31e 218769 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 218772 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 218772 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 218779 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218787 ENV polyprotein (coat polyprotein) 218788 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218788 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 218792 Squalene/phytoene synthase 218799 ENV polyprotein (coat polyprotein) 218811 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 218814 Core histone H2A/H2B/H3/H4 218827 Ribosomal protein L31e 218832 RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family 218832 RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family includ 218840 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 218865 GMC oxidoreductase. This family of proteins bind FAD as a cofactor 218865 GMC oxidoreductase. This family of proteins bind FAD as a cofactor 218865 GMC oxidoreductase. This family of proteins bind FAD as a cofactor 218873 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 218877 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 218878 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is 218878 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 218933 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 218940 Ribosomal protein L6e 218945 CAP protein 218948 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 218955 Ribosomal protein L6 218963 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 218977 Guanylate-kinase-associated protein (GKAP) protein 218979 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 218986 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 218997 Ribosomal protein L19e 219002 7 transmembrane receptor (rhodopsin family) 219007 7 transmembrane receptor (rhodopsin family) 219009 7 transmembrane receptor (rhodopsin family) 219010 7 transmembrane receptor (rhodopsin family) 219011 7 transmembrane receptor (rhodopsin family) 219014 7 transmembrane receptor (rhodopsin family) 219016 7 transmembrane receptor (rhodopsin family) 219019 7 transmembrane receptor (rhodopsin family) 219021 7 transmembrane receptor (rhodopsin family) 219033 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 219033 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 219036 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 219037 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 219042 7 transmembrane receptor (rhodopsin family) 219048 7 transmembrane receptor (rhodopsin family) 219049 Proteasome A-type and B-type 219050 7 transmembrane receptor (rhodopsin family) 219054 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 219056 Enolase, C-terminal TIM barrel domain 219057 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 219058 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 219060 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 219061 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 219081 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 219102 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 219102 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 219103 Uncharacterized ACR, COG1579 219103 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 219106 Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein 219107 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 219110 Ribosomal S17 219126 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 219126 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 219133 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 219140 PH domain. PH stands for pleckstrin homology 219140 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 219140 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 219144 ADP-ribosylation factor family 219144 ADP-ribosylation factor family 219144 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 219145 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 219146 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 219147 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 219150 Homeobox domain 219167 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 219180 Ribosomal protein S5, C-terminal domain 219183 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 219183 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 219183 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 219186 HMG (high mobility group) box 219191 Ribosomal protein S6e 219230 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 219236 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 219252 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 219252 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 219254 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 219254 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 219257 Cadherin domain 219257 Cadherin domain 219293 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 219332 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 219333 Ubiquitin carboxyl-terminal hydrolase family 2 219333 Ubiquitin carboxyl-terminal hydrolases family 2 219346 Homeobox domain 219402 Translation initiation factor IF-3 219405 Ribosomal protein L21e 219409 Homeobox domain 219409 Homeobox domain 219414 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 219417 7 transmembrane receptor (rhodopsin family) 219421 7 transmembrane receptor (rhodopsin family) 219424 7 transmembrane receptor (rhodopsin family) 219425 7 transmembrane receptor (rhodopsin family) 219427 7 transmembrane receptor (rhodopsin family) 219428 7 transmembrane receptor (rhodopsin family) 219429 7 transmembrane receptor (rhodopsin family) 219430 7 transmembrane receptor (rhodopsin family) 219431 7 transmembrane receptor (rhodopsin family) 219432 7 transmembrane receptor (rhodopsin family) 219436 7 transmembrane receptor (rhodopsin family) 219437 7 transmembrane receptor (rhodopsin family) 219438 7 transmembrane receptor (rhodopsin family) 219439 7 transmembrane receptor (rhodopsin family) 219440 7 transmembrane receptor (rhodopsin family) 219442 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 219443 7 transmembrane receptor (rhodopsin family) 219444 7 transmembrane receptor (rhodopsin family) 219445 7 transmembrane receptor (rhodopsin family) 219447 7 transmembrane receptor (rhodopsin family) 219449 7 transmembrane receptor (rhodopsin family) 219450 7 transmembrane receptor (rhodopsin family) 219451 7 transmembrane receptor (rhodopsin family) 219453 7 transmembrane receptor (rhodopsin family) 219455 7 transmembrane receptor (rhodopsin family) 219459 7 transmembrane receptor (rhodopsin family) 219463 7 transmembrane receptor (rhodopsin family) 219464 7 transmembrane receptor (rhodopsin family) 219465 7 transmembrane receptor (rhodopsin family) 219467 7 transmembrane receptor (rhodopsin family) 219469 7 transmembrane receptor (rhodopsin family) 219473 7 transmembrane receptor (rhodopsin family) 219475 7 transmembrane receptor (rhodopsin family) 219477 7 transmembrane receptor (rhodopsin family) 219478 7 transmembrane receptor (rhodopsin family) 219479 7 transmembrane receptor (rhodopsin family) 219480 7 transmembrane receptor (rhodopsin family) 219481 7 transmembrane receptor (rhodopsin family) 219482 7 transmembrane receptor (rhodopsin family) 219484 7 transmembrane receptor (rhodopsin family) 219485 7 transmembrane receptor (rhodopsin family) 219487 7 transmembrane receptor (rhodopsin family) 219490 7 transmembrane receptor (rhodopsin family) 219491 7 transmembrane receptor (rhodopsin family) 219493 7 transmembrane receptor (rhodopsin family) 219494 7 transmembrane receptor (rhodopsin family) 219522 Fatty acid desaturase 219522 7 transmembrane receptor (rhodopsin family) 219523 7 transmembrane receptor (rhodopsin family) 219527 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 219527 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 219537 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 219539 Yippee putative zinc-binding protein 219542 Ribosomal protein S26e 219558 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 219558 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 219595 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 219635 Ribosomal protein L44 219662 Ribosomal protein L21e 219712 Hsp90 protein 219756 Protein kinase domain 219770 Connexin 219771 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 219771 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi 219773 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 219809 Ribosomal L10 219843 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 219858 7 transmembrane receptor (rhodopsin family) 219865 7 transmembrane receptor (rhodopsin family) 219866 7 transmembrane receptor (rhodopsin family) 219868 7 transmembrane receptor (rhodopsin family) 219869 7 transmembrane receptor (rhodopsin family) 219870 7 transmembrane receptor (rhodopsin family) 219871 7 transmembrane receptor (rhodopsin family) 219872 7 transmembrane receptor (rhodopsin family) 219873 7 transmembrane receptor (rhodopsin family) 219874 7 transmembrane receptor (rhodopsin family) 219875 7 transmembrane receptor (rhodopsin family) 219891 Programmed cell death protein 2, C-terminal domain 219918 ENV polyprotein (coat polyprotein) 219927 Ribosomal prokaryotic L21 protein 219927 Ribosomal prokaryotic L21 protein 219928 7 transmembrane receptor (rhodopsin family) 219931 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 219952 7 transmembrane receptor (rhodopsin family) 219953 7 transmembrane receptor (rhodopsin family) 219953 7 transmembrane receptor (metabotropic glutamate family) 219954 7 transmembrane receptor (rhodopsin family) 219955 7 transmembrane receptor (rhodopsin family) 219956 7 transmembrane receptor (rhodopsin family) 219957 7 transmembrane receptor (rhodopsin family) 219958 7 transmembrane receptor (rhodopsin family) 219959 7 transmembrane receptor (rhodopsin family) 219960 7 transmembrane receptor (rhodopsin family) 219961 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 219962 7 transmembrane receptor (rhodopsin family) 219964 7 transmembrane receptor (rhodopsin family) 219965 7 transmembrane receptor (rhodopsin family) 219966 7 transmembrane receptor (rhodopsin family) 219967 7 transmembrane receptor (rhodopsin family) 219968 7 transmembrane receptor (rhodopsin family) 219972 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 219979 7 transmembrane receptor (rhodopsin family) 219980 7 transmembrane receptor (rhodopsin family) 219981 7 transmembrane receptor (rhodopsin family) 219982 7 transmembrane receptor (rhodopsin family) 219983 7 transmembrane receptor (rhodopsin family) 219985 7 transmembrane receptor (rhodopsin family) 219986 7 transmembrane receptor (rhodopsin family) 219987 7 transmembrane receptor (rhodopsin family) 219995 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 219996 Zona pellucida-like domain 220001 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 220002 Cytochrome b561 220027 Adenylate and Guanylate cyclase catalytic domain 220056 Proteasome A-type and B-type 220061 Actinobacillus constitutively-expressed outer membrane lipoprotein A 220071 ENV polyprotein (coat polyprotein) 220077 PTB domain (IRS-1 type) 220079 NAD:arginine ADP-ribosyltransferase 220115 Protein-tyrosine phosphatase 220115 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 220147 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 220147 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 220148 Pyridoxal-dependent decarboxylase conserved domain 220148 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b 220359 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d 220359 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 220429 Uncharacterized ACR, COG1579 220433 Ribosomal family S4e 220466 PH domain. PH stands for pleckstrin homology 220466 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 220486 Zinc finger, C3HC4 type (RING finger) 220486 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 220522 Gamma-glutamyltranspeptidase 220594 Ubiquitin carboxyl-terminal hydrolase family 2 220635 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 220667 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 220670 Ribosomal protein S2 220672 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 220686 Phosphatidylinositol 3- and 4-kinase 220717 Ribosomal protein L10 220729 FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase 220758 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 220763 Hsp90 protein 220830 PX domain. PX domains bind to phosphoinositides 220830 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 220832 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 220840 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 220848 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 220869 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 220895 Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity 220906 L1 transposable element 220959 Intermediate filament protein 220959 IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation 220959 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 220973 7 transmembrane receptor (rhodopsin family) 220984 Ribosomal protein L6 220988 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 220999 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 221009 Poly-adenylate binding protein, unique domain 221017 Intermediate filament protein 221035 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 221037 jmjC domain 221037 jmjC domain 221057 Ribosomal protein S2 221074 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 221078 NOL1/NOP2/sun family 221079 ADP-ribosylation factor family 221079 ADP-ribosylation factor family 221079 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 221079 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 221091 Fibronectin type III domain 221092 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 221092 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 221180 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 221184 C2 domain 221188 7 transmembrane receptor (Secretin family) 221188 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 221191 Trypsin 221223 Carboxylesterase 221259 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other 221264 Adenylate kinase 221267 Calponin family repeat 221267 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 221270 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 221278 Zinc finger, C3HC4 type (RING finger) 221278 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 221286 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 221300 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 221311 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 221324 Ribosomal protein L13 221324 7 transmembrane receptor (rhodopsin family) 221341 Regulator of chromosome condensation (RCC1) 221349 Ribosomal L10 221349 Nucleotide-sensitive chloride conductance regulator (ICln) 221356 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 221357 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation 221368 Ribosomal protein S8 221374 Ribosomal S17 221382 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 221382 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 221390 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 221391 7 transmembrane receptor (rhodopsin family) 221392 Ribosomal protein L15 221393 7 transmembrane receptor (Secretin family) 221395 7 transmembrane receptor (Secretin family) 221400 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 221424 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 221456 Kinesin motor domain 221460 Ribosomal protein S5, N-terminal domain 221496 LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1 221504 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 221509 Class II histocompatibility antigen, beta domain 221527 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 221527 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 221544 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 221547 ADP-ribosylation factor family 221547 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 221549 Class I Histocompatibility antigen, domains alpha 1 and 2 221550 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 221557 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 221566 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 221566 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 221567 7 transmembrane receptor (rhodopsin family) 221568 7 transmembrane receptor (rhodopsin family) 221570 7 transmembrane receptor (rhodopsin family) 221571 7 transmembrane receptor (rhodopsin family) 221572 ADP-ribosylation factor family 221572 7 transmembrane receptor (rhodopsin family) 221586 7 transmembrane receptor (rhodopsin family) 221608 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 221622 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 221644 Ribosomal L29e protein family 221654 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 221654 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 221692 RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impli 221697 Ribosomal L15 221703 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 221721 Ribosomal protein L21e 221740 NAC domain 221823 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosp 221831 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 221831 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 221831 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 221839 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 221839 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 221875 Tropomyosin 221889 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 221890 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 221914 Glypican 221935 Fibronectin type III domain 221937 Fork head domain 221937 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 221948 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 221955 Lipase (class 3) 221979 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 222005 Mitochondrial carrier protein 222009 Ribosomal protein L31e 222017 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 222017 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 222019 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 222032 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 222052 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 222068 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 222068 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 222103 Intermediate filament protein 222117 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 222258 WD domain, G-beta repeat 222346 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 222487 7 transmembrane receptor (Secretin family) 222487 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 222537 Sulfotransferase protein 222545 7 transmembrane receptor (metabotropic glutamate family) 222545 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 222546 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 222611 7 transmembrane receptor (Secretin family) 222642 TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermediat 222643 Death domain 222663 CUB domain 222696 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 222696 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 222699 BTG1 family. A novel family of anti-proliferative proteins 222865 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 222894 Helix-loop-helix DNA-binding domain 222901 Ribosomal protein L14p/L23e 222962 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 223117 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 223126 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 223127 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 223127 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 223134 Ribosomal S3Ae family 223134 FMN-dependent dehydrogenase 223135 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 223149 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223149 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223155 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 223156 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223156 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223176 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 223176 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 223176 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 223177 Actin 223181 Actin 223196 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 223199 ENV polyprotein (coat polyprotein) 223200 Hsp90 protein 223202 Ribosomal S17 223227 HMG (high mobility group) box 223227 HMG (high mobility group) box 223227 HMG (high mobility group) box 223254 PH domain. PH stands for pleckstrin homology 223254 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 223254 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 223254 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 223261 Ribosomal protein S17 223262 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe 223267 Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans 223278 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223278 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223283 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223283 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223298 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 223319 WD domain, G-beta repeat 223328 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 223335 UDP-glucoronosyl and UDP-glucosyl transferase 223337 UDP-glucoronosyl and UDP-glucosyl transferase 223337 UDP-glucoronosyl and UDP-glucosyl transferase 223346 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 223364 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 223365 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223365 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223375 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 223385 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 223397 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 223398 RNA polymerase Rpb4 223398 Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function 223399 ENV polyprotein (coat polyprotein) 223404 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 223409 WD domain, G-beta repeat 223417 Cyclophilin type peptidyl-prolyl cis-trans isomerase 223425 Ribosomal protein S5, C-terminal domain 223425 Ribosomal protein S5, N-terminal domain 223435 Protein kinase domain 223435 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 223470 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 223470 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 223481 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 223481 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 223488 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 223496 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 223499 WD domain, G-beta repeat 223499 Sof1-like domain. Sof1 is essential for cell growth and is a component of the nucleolar rRNA processing machinery 223500 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223500 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223502 Ribosomal protein L6e 223510 ENV polyprotein (coat polyprotein) 223510 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 223522 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 223522 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 223524 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 223526 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 223536 Ribosomal protein L6e 223539 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223557 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 223573 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223573 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223582 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223594 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 223595 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223595 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223603 NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown 223626 4-hydroxybenzoyl-CoA thioesterase. This enzyme (EC 3.1.2.23) catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3 223628 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 223635 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 223635 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 223635 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 223635 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 223644 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223649 Protein kinase domain 223650 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 223654 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 223666 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 223666 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 223693 Domain of unknown function 223701 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 223701 RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impl 223706 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 223722 Acyl transferase domain 223723 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 223740 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 223745 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 223770 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 223770 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 223770 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 223772 BED zinc finger 223772 pfam02892, zf-BED, BED zinc finger 223773 pfam02892, zf-BED, BED zinc finger 223773 pfam02892, zf-BED, BED zinc finger 223775 Protein kinase domain 223776 Uncharacterized ACR, YdiU/UPF0061 family 223782 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 223797 Ribosomal protein L31e 223804 Sugar (and other) transporter 223811 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 223814 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 223819 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 223825 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 223825 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 223827 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 223838 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 223838 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 223838 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 223838 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 223843 Homeobox domain 223855 Ribosomal protein L36e 223862 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 223862 Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer 223864 Cyclic nucleotide-binding domain 223864 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 223871 7 transmembrane receptor (rhodopsin family) 223874 7 transmembrane receptor (rhodopsin family) 223876 7 transmembrane receptor (rhodopsin family) 223878 7 transmembrane receptor (rhodopsin family) 223903 Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio 223908 Intermediate filament protein 223909 Intermediate filament protein 223910 Intermediate filament protein 223911 Intermediate filament protein 223915 Intermediate filament protein 223917 Intermediate filament protein 223918 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 223928 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 223928 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 223931 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 223932 7 transmembrane receptor (rhodopsin family) 223934 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 223934 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 223981 ENV polyprotein (coat polyprotein) 223995 Cadherin domain 223998 Poly-adenylate binding protein, unique domain 223998 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 224004 ENV polyprotein (coat polyprotein) 224010 Ribosomal L29e protein family 224014 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 224014 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 224014 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 224020 Phosphatidylinositol 3- and 4-kinase 224020 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 224023 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 224023 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 224041 7 transmembrane receptor (rhodopsin family) 224041 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 224041 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 224041 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 224044 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 224045 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 224048 Proteasome A-type and B-type 224054 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 224073 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 224078 DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons 224087 TMS membrane protein/tumour differentially expressed protein (TDE) 224087 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 224087 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 224092 GTPase of unknown function 224092 GTPase of unknown function 224092 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 224097 Ribosomal protein L31e 224105 Protein kinase domain 224105 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 224115 von Willebrand factor type D domain 224115 AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges 224119 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 224119 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 224133 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 224134 Ribosomal protein S5, C-terminal domain 224134 Ribosomal protein S5, N-terminal domain 224137 Ribosomal L29e protein family 224139 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 224139 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 224145 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 224147 Proteasome A-type and B-type 224156 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 224161 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 224163 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 224173 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 224173 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 224176 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 224176 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 224185 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 224185 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 224191 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 224201 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 224206 MCM2/3/5 family 224207 ENV polyprotein (coat polyprotein) 224211 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 224218 Protein kinase domain 224219 ENV polyprotein (coat polyprotein) 224224 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 224234 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 224243 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 224244 Ribosomal protein L13e 224250 PMP-22/EMP/MP20/Claudin family 224251 7 transmembrane receptor (rhodopsin family) 224256 7 transmembrane receptor (rhodopsin family) 224260 7 transmembrane receptor (rhodopsin family) 224264 7 transmembrane receptor (rhodopsin family) 224265 7 transmembrane receptor (rhodopsin family) 224270 7 transmembrane receptor (rhodopsin family) 224273 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 224273 QXW lectin repeat 224273 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 224281 NOL1/NOP2/sun family 224344 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 224381 FKBP-type peptidyl-prolyl cis-trans isomerase 224392 Ribosomal protein L21e 224418 Ribosomal protein L31e 224428 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 224432 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 224442 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 224443 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 224445 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 224480 Oxidoreductase FAD-binding domain 224480 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 224485 Ribosomal protein L34e 224487 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b 224496 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 224496 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 224496 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 224497 Ribosomal L15 224498 Actin 224503 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 224504 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 224508 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 224508 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 224512 Cyclophilin type peptidyl-prolyl cis-trans isomerase 224532 Golgi 4-transmembrane spanning transporter 224533 Ornithine decarboxylase antizyme 224534 Proteasome A-type and B-type 224543 7 transmembrane receptor (rhodopsin family) 224544 7 transmembrane receptor (metabotropic glutamate family) 224544 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224544 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 224546 Sugar (and other) transporter 224546 7 transmembrane receptor (metabotropic glutamate family) 224546 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224547 7 transmembrane receptor (metabotropic glutamate family) 224548 ENV polyprotein (coat polyprotein) 224548 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 224549 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224552 ENV polyprotein (coat polyprotein) 224552 7 transmembrane receptor (metabotropic glutamate family) 224552 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224554 L1 transposable element 224564 HMG (high mobility group) box 224564 7 transmembrane receptor (metabotropic glutamate family) 224564 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224569 7 transmembrane receptor (metabotropic glutamate family) 224569 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224572 7 transmembrane receptor (metabotropic glutamate family) 224572 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224573 Sugar (and other) transporter 224573 ENV polyprotein (coat polyprotein) 224573 7 transmembrane receptor (metabotropic glutamate family) 224573 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 224573 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224574 7 transmembrane receptor (rhodopsin family) 224575 7 transmembrane receptor (rhodopsin family) 224576 7 transmembrane receptor (metabotropic glutamate family) 224576 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 224576 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224578 7 transmembrane receptor (metabotropic glutamate family) 224578 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224580 7 transmembrane receptor (metabotropic glutamate family) 224580 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224582 7 transmembrane receptor (metabotropic glutamate family) 224582 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224603 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224605 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224607 ENV polyprotein (coat polyprotein) 224607 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 224607 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 224607 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 224607 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 224616 Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein 224627 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 224630 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 224650 Phosphotyrosine interaction domain (PTB/PID) 224650 Phosphotyrosine interaction domain (PTB/PID) 224650 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 224671 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 224687 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 224691 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224691 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224692 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224694 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224697 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 224697 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 224705 Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 224705 Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 224727 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 224742 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 224748 Class I Histocompatibility antigen, domains alpha 1 and 2 224752 Homeobox domain 224753 Class I Histocompatibility antigen, domains alpha 1 and 2 224753 Class I Histocompatibility antigen, domains alpha 1 and 2 224753 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224753 Class I Histocompatibility antigen, domains alpha 1 and 2 224753 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224754 Class I Histocompatibility antigen, domains alpha 1 and 2 224754 Class I Histocompatibility antigen, domains alpha 1 and 2 224756 Class I Histocompatibility antigen, domains alpha 1 and 2 224756 Class I Histocompatibility antigen, domains alpha 1 and 2 224756 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224756 Class I Histocompatibility antigen, domains alpha 1 and 2 224756 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224758 ENV polyprotein (coat polyprotein) 224758 Class I Histocompatibility antigen, domains alpha 1 and 2 224759 Class I Histocompatibility antigen, domains alpha 1 and 2 224761 Class I Histocompatibility antigen, domains alpha 1 and 2 224761 Class I Histocompatibility antigen, domains alpha 1 and 2 224761 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224761 Class I Histocompatibility antigen, domains alpha 1 and 2 224761 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224763 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 224763 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 224764 7 transmembrane receptor (rhodopsin family) 224765 7 transmembrane receptor (rhodopsin family) 224766 7 transmembrane receptor (rhodopsin family) 224767 7 transmembrane receptor (rhodopsin family) 224768 7 transmembrane receptor (rhodopsin family) 224769 L1 transposable element 224769 Class I Histocompatibility antigen, domains alpha 1 and 2 224769 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 224771 7 transmembrane receptor (rhodopsin family) 224773 7 transmembrane receptor (rhodopsin family) 224776 7 transmembrane receptor (rhodopsin family) 224781 ENV polyprotein (coat polyprotein) 224782 Protein kinase domain 224792 7 transmembrane receptor (Secretin family) 224792 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 224794 Sulfatase 224794 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 224805 tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases 224807 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 224813 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 224836 Ubiquitin carboxyl-terminal hydrolase family 2 224847 ENV polyprotein (coat polyprotein) 224860 C2 domain 224860 PH domain. PH stands for pleckstrin homology 224860 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 224860 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 224870 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 224872 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 224872 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 224875 Protein kinase domain 224876 Protein kinase domain 224883 P53 224893 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 224894 Ribosomal protein L21e 224902 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 224903 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 224910 Adenylate kinase 224910 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 224914 Zinc finger, C3HC4 type (RING finger) 224916 7 transmembrane receptor (metabotropic glutamate family) 224916 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 224920 Protein kinase domain 224923 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 224923 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 224933 Cyclophilin type peptidyl-prolyl cis-trans isomerase 224949 Ribosomal protein S6e 224954 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 224955 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 224997 Guanylate-kinase-associated protein (GKAP) protein 225010 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 225013 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 225013 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 225014 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 225019 Ribosomal protein L13e 225021 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 225022 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 225028 Protein kinase domain 225028 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 225029 Ribosomal protein S2 225030 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 225030 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 225033 Ribosomal protein L31e 225046 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 225055 Putative zinc finger in N-recognin 225058 Ribosomal protein S24e 225065 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 225077 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 225077 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 225077 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 225077 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 225093 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 225095 ENV polyprotein (coat polyprotein) 225096 Actin 225099 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225099 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225101 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 225103 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 225104 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 225104 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 225104 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 225105 Cyclophilin type peptidyl-prolyl cis-trans isomerase 225107 Ribosomal protein L3 225114 Cadherin domain 225115 Villin headpiece domain 225118 Eukaryotic protein of unknown function, DUF279 225127 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 225127 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 225127 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 225129 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 225129 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 225129 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 225134 Ribosomal family S4e 225139 SH2 domain 225141 Ribosomal protein S21e 225141 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 225149 ENV polyprotein (coat polyprotein) 225151 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 225151 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 225151 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 225151 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 225151 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 225152 Connexin 225154 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 225154 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 225156 ENV polyprotein (coat polyprotein) 225160 Death domain 225187 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 225187 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 225192 7 transmembrane receptor (rhodopsin family) 225195 Ribosomal protein S2 225201 Ribosomal L29e protein family 225215 Ribosomal protein L24e 225225 Fibrillarin 225227 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225234 Ribosomal L10 225256 Cadherin domain 225256 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 225257 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 225264 Pyruvate kinase, alpha/beta domain 225264 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 225264 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 225266 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 225273 Ribosomal protein S2 225277 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225277 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225307 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 225314 Macrophage migration inhibitory factor (MIF) 225317 Ribosomal protein S8e 225326 Phosphatidylinositol 3- and 4-kinase 225326 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 225326 Phosphatidylinositol 3- and 4-kinase 225326 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 225326 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 225337 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 225339 Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function 225341 LIM domain. This family represents two copies of the LIM structural domain 225343 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 225348 Utp21 specific WD40 associated domain. Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA 225350 Glycoprotease family 225353 HMG (high mobility group) box 225362 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 225367 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 225367 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 225372 Phosphotyrosine interaction domain (PTB/PID) 225387 L1 transposable element 225387 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 225399 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 225416 Ribosomal protein S19e 225432 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 225442 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 225445 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 225446 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 225470 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 225482 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225482 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225495 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225495 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225526 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225526 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225527 HMG (high mobility group) box 225543 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225550 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225550 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225552 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225552 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225559 Glutamine synthetase, catalytic domain 225559 Glutamine synthetase, beta-Grasp domain 225579 AMP-binding enzyme 225586 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 225586 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 225600 3'5'-cyclic nucleotide phosphodiesterase 225620 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 225620 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 225631 Homeobox domain 225636 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 225636 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 225638 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 225681 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225681 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225728 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 225742 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 225742 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 225742 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 225743 Zinc finger, C3HC4 type (RING finger) 225761 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225761 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225763 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 225799 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 225799 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 225805 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 225807 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 225812 SKIP/SNW domain. This domain is found in chromatin proteins 225814 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 225817 Cyclophilin type peptidyl-prolyl cis-trans isomerase 225822 Ribosomal L29e protein family 225826 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 225826 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 225827 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 225845 NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins 225849 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 225865 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 225870 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 225875 Fibronectin type III domain 225875 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 225876 F-box domain 225876 jmjC domain 225876 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 225898 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes 225903 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 225912 Cytochrome b561 225919 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 225924 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 225926 7 transmembrane receptor (rhodopsin family) 225932 7 transmembrane receptor (rhodopsin family) 225934 7 transmembrane receptor (rhodopsin family) 225935 7 transmembrane receptor (rhodopsin family) 225936 7 transmembrane receptor (rhodopsin family) 225939 7 transmembrane receptor (rhodopsin family) 225940 7 transmembrane receptor (rhodopsin family) 225941 7 transmembrane receptor (rhodopsin family) 225943 Trypsin 225951 7 transmembrane receptor (rhodopsin family) 225956 7 transmembrane receptor (rhodopsin family) 225957 ENV polyprotein (coat polyprotein) 225957 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 225958 Protein kinase domain 225959 7 transmembrane receptor (rhodopsin family) 225961 7 transmembrane receptor (rhodopsin family) 225962 7 transmembrane receptor (rhodopsin family) 225963 7 transmembrane receptor (rhodopsin family) 225970 7 transmembrane receptor (rhodopsin family) 225971 7 transmembrane receptor (rhodopsin family) 225972 7 transmembrane receptor (rhodopsin family) 225974 Ribosomal protein L13e 225997 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a 225997 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 225998 metallopeptidase family M24 225998 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 226017 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 226017 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 226025 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 226025 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 226030 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 226030 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 226036 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 226041 Phosphoglucomutase/phosphomannomutase, C-terminal domain 226041 Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 226041 Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 226041 pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 226041 pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 226041 pfam02880, PGM_PMM_III, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain III 226043 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 226047 ADP-ribosylation factor family 226047 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 226049 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 226063 Ribosomal protein L36e 226081 Dynein light chain type 1 226082 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 226086 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 226097 Proteasome A-type and B-type 226099 Exocyst complex subunit Sec15-like 226100 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 226101 C2 domain 226101 C2 domain 226101 C2 domain 226102 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 226105 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 226106 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 226108 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 226118 Adenovirus minor core protein PV 226119 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 226121 Ribosomal protein L44 226122 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 226123 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 226123 Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are 226143 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 226144 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 226160 Ribosomal protein L6, N-terminal domain 226180 Intermediate filament protein 226180 Intermediate filament protein 226182 WD domain, G-beta repeat 226200 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 226201 Ribosomal protein L13 226212 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 226248 Macrophage migration inhibitory factor (MIF) 226250 PH domain. PH stands for pleckstrin homology 226251 LIM domain. This family represents two copies of the LIM structural domain 226251 Villin headpiece domain 226251 LIM domain. This family represents two copies of the LIM structural domain 226265 Enolase, C-terminal TIM barrel domain 226273 Fibronectin type III domain 226278 7 transmembrane receptor (rhodopsin family) 226292 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 226294 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 226294 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 226304 7 transmembrane receptor (rhodopsin family) 226310 Ribosomal protein S19 226313 S-adenosyl-L-homocysteine hydrolase 226317 Ribosomal protein S5, C-terminal domain 226322 ENV polyprotein (coat polyprotein) 226325 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 226326 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 226335 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 226335 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 226336 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 226338 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 226343 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 226344 WD domain, G-beta repeat 226348 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 226348 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 226349 Ribosomal S3Ae family 226352 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 226359 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 226376 GDP dissociation inhibitor 226387 NAC domain 226393 7 transmembrane receptor (rhodopsin family) 226409 HNH endonuclease 226409 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 226409 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 226413 Glycosyl hydrolase family 1 226416 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 226421 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 226432 Importin-beta N-terminal domain 226438 Fibronectin type III domain 226439 Helix-loop-helix DNA-binding domain 226443 Porphobilinogen deaminase, C-terminal domain 226443 Porphobilinogen deaminase, dipyromethane cofactor binding domain 226462 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 226472 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 226472 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 226473 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 226473 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 226475 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 226478 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 226479 Core histone H2A/H2B/H3/H4 226491 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 226494 Hsp90 protein 226496 Ribosomal protein S8e 226503 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 226513 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 226519 Laminin B (Domain IV) 226519 Laminin N-terminal (Domain VI) 226519 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 226519 Laminin B (Domain IV) 226519 Laminin N-terminal (Domain VI) 226519 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 226519 Methyl-accepting chemotaxis protein (MCP) signaling domain. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs 226519 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 226519 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 226523 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 226527 Zinc-binding dehydrogenase 226541 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 226554 Dynamin family 226554 PH domain. PH stands for pleckstrin homology 226561 Eukaryotic initiation factor 4E 226564 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 226565 Thi4 family. This family includes a putative thiamine biosynthetic enzyme 226565 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 226565 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 226565 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 226591 TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway 226601 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 226601 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 226602 ENV polyprotein (coat polyprotein) 226602 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 226602 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 226604 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 226604 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 226640 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 226641 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 226641 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 226643 Ribosomal protein S24e 226646 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 226654 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 226673 Protein-tyrosine phosphatase 226673 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 226685 Pentaxin family. Pentaxins are also known as pentraxins 226689 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 226689 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 226690 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 226691 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 226695 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 226695 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 226702 7 transmembrane receptor (rhodopsin family) 226705 7 transmembrane receptor (rhodopsin family) 226710 7 transmembrane receptor (rhodopsin family) 226711 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 226720 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 226723 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 226732 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 226733 Ribosomal protein L44 226735 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 226740 Ribosomal L29e protein family 226745 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 226745 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 226745 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 226751 Uncharacterized ACR, COG1579 226751 Intermediate filament protein 226751 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 226751 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 226751 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 226751 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 226778 Kinase associated domain 1 226781 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 226783 Homeobox domain 226786 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 226786 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 226791 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 226822 Spumavirus gag protein 226823 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 226830 MYND finger 226849 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 226856 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 226859 Cyclin-dependent kinase regulatory subunit 226861 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 226866 Recombination protein O. Recombination protein O (RecO) is involved in DNA repair and pfam00470 pathway recombination 226871 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 226874 NUDIX domain 226875 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 226877 HMG (high mobility group) box 226878 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 226878 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 226882 Cyclophilin type peptidyl-prolyl cis-trans isomerase 226884 ENV polyprotein (coat polyprotein) 226884 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 226884 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 226884 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 226884 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 226885 HMG (high mobility group) box 226896 Transcription factor AP-2 226922 KCNQ1 voltage-gated potassium channel 226922 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 226922 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 226922 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 226928 C2 domain 226936 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 226936 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 226948 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 226948 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 226949 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 226950 Elongation factor 1 gamma, conserved domain 226950 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 226950 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 226952 Protein kinase domain 226952 Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function 226954 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 226955 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 226956 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 226970 PH domain. PH stands for pleckstrin homology 226970 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 226970 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 226982 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 226982 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 226985 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 226999 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 227000 Cyclophilin type peptidyl-prolyl cis-trans isomerase 227035 ADP-ribosylation factor family 227036 ADP-ribosylation factor family 227036 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 227036 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 227037 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 227038 FKBP-type peptidyl-prolyl cis-trans isomerase 227042 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 227044 'chromo' (CHRromatin Organization MOdifier) domain 227044 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 227044 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 227045 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 227054 Ribosomal protein L23 227054 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 227055 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 227056 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 227058 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 227059 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 227061 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 227078 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 227089 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227089 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227090 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 227090 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 227092 HMG (high mobility group) box 227102 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 227114 NAC domain 227114 Orbivirus outer capsid protein VP2. VP2 acts as an anchor for VP1 and VP3. VP2 contains a non-specific DNA and RNA binding domain in the N-terminus 227114 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 227119 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 227120 C2 domain 227120 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 227134 Homeobox domain 227150 Ribosomal protein L31e 227157 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 227231 D-ala D-ala ligase. This family contains D-alanine--D-alanine ligase enzymes EC:6.3.2.4 227231 Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn 227231 Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy 227231 Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in prokar 227231 MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also i 227234 Kinesin motor domain 227237 Ribosomal protein L21e 227242 Ribosomal protein L21e 227261 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 227261 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 227266 Ribosomal protein L21e 227288 7 transmembrane receptor (rhodopsin family) 227288 7 transmembrane receptor (rhodopsin family) 227289 7 transmembrane receptor (rhodopsin family) 227292 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 227292 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 227309 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 227318 L1 transposable element 227320 L1 transposable element 227326 7 transmembrane receptor (rhodopsin family) 227327 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 227333 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 227333 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 227333 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 227333 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 227334 Ubiquitin carboxyl-terminal hydrolase family 2 227334 Ubiquitin carboxyl-terminal hydrolases family 2 227335 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 227336 UDP-glucoronosyl and UDP-glucosyl transferase 227354 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 227354 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 227354 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 227354 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 227367 7 transmembrane receptor (rhodopsin family) 227377 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 227384 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 227388 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 227393 Dienelactone hydrolase family 227393 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 227394 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 227394 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 227396 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 227397 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 227399 Histidine acid phosphatase 227403 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227403 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227407 ENV polyprotein (coat polyprotein) 227411 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 227411 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 227411 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 227412 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 227416 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 227421 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 227425 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 227435 Domain of unknown function 227435 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 227435 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 227435 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 227449 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 227458 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 227460 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 227461 Ribosomal protein S6e 227462 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 227485 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 227489 CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID 227513 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 227513 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 227517 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 227517 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 227517 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 227522 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 227532 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 227559 Ribosomal protein L23 227564 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 227566 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227567 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227567 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227570 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 227570 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 227572 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 227572 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 227588 Eukaryotic initiation factor 1A 227599 AMP-binding enzyme 227606 Transcription factor TFIID (or TATA-binding protein, TBP) 227619 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 227620 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid 227624 ADP-ribosylation factor family 227624 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 227627 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 227630 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 227632 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 227638 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 227656 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 227659 Sugar (and other) transporter 227671 Glycosyltransferase family 6 227677 Homeobox domain 227729 Ribosomal L15 227731 Mitochondrial carrier protein 227733 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 227736 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 227744 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 227746 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 227757 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 227770 7 transmembrane receptor (rhodopsin family) 227772 7 transmembrane receptor (rhodopsin family) 227775 7 transmembrane receptor (rhodopsin family) 227777 Eukaryotic ribosomal protein L18 227778 7 transmembrane receptor (rhodopsin family) 227780 7 transmembrane receptor (rhodopsin family) 227785 NAC domain 227786 7 transmembrane receptor (rhodopsin family) 227787 7 transmembrane receptor (rhodopsin family) 227788 7 transmembrane receptor (rhodopsin family) 227789 7 transmembrane receptor (rhodopsin family) 227794 7 transmembrane receptor (rhodopsin family) 227796 7 transmembrane receptor (rhodopsin family) 227800 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary 227801 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 227802 7 transmembrane receptor (Secretin family) 227802 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 227827 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227827 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227847 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 227847 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 227852 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 227859 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227859 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227885 Hsp90 protein 227888 Inorganic pyrophosphatase 227906 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227920 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227920 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227937 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 227940 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 227940 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 227945 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 227968 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 227968 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 227974 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 227974 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 227989 14-3-3 protein 227992 Ribosomal protein L21e 228003 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 228005 Cyclophilin type peptidyl-prolyl cis-trans isomerase 228026 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 228033 ATP synthase subunit C 228065 GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases 228123 Ribosomal family S4e 228124 Cyclophilin type peptidyl-prolyl cis-trans isomerase 228126 Enolase, C-terminal TIM barrel domain 228139 ATP P2X receptor 228145 7 transmembrane receptor (rhodopsin family) 228151 Fatty acid desaturase 228151 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 228152 7 transmembrane receptor (rhodopsin family) 228153 7 transmembrane receptor (rhodopsin family) 228157 7 transmembrane receptor (rhodopsin family) 228161 7 transmembrane receptor (rhodopsin family) 228166 7 transmembrane receptor (rhodopsin family) 228176 7 transmembrane receptor (rhodopsin family) 228178 7 transmembrane receptor (rhodopsin family) 228184 7 transmembrane receptor (rhodopsin family) 228185 7 transmembrane receptor (rhodopsin family) 228186 7 transmembrane receptor (rhodopsin family) 228189 ENV polyprotein (coat polyprotein) 228189 7 transmembrane receptor (rhodopsin family) 228189 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 228196 7 transmembrane receptor (rhodopsin family) 228196 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 228199 Dynamin family 228199 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 228204 7 transmembrane receptor (rhodopsin family) 228206 7 transmembrane receptor (rhodopsin family) 228209 7 transmembrane receptor (rhodopsin family) 228210 7 transmembrane receptor (rhodopsin family) 228212 7 transmembrane receptor (rhodopsin family) 228213 7 transmembrane receptor (rhodopsin family) 228218 7 transmembrane receptor (rhodopsin family) 228228 7 transmembrane receptor (rhodopsin family) 228235 ADP-ribosylation factor family 228237 Protein kinase domain 228238 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 228239 7 transmembrane receptor (rhodopsin family) 228243 7 transmembrane receptor (rhodopsin family) 228252 7 transmembrane receptor (rhodopsin family) 228253 7 transmembrane receptor (rhodopsin family) 228256 7 transmembrane receptor (rhodopsin family) 228260 7 transmembrane receptor (rhodopsin family) 228263 7 transmembrane receptor (rhodopsin family) 228269 7 transmembrane receptor (rhodopsin family) 228271 7 transmembrane receptor (rhodopsin family) 228272 7 transmembrane receptor (rhodopsin family) 228273 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 228274 7 transmembrane receptor (rhodopsin family) 228275 7 transmembrane receptor (rhodopsin family) 228276 7 transmembrane receptor (rhodopsin family) 228278 7 transmembrane receptor (rhodopsin family) 228279 7 transmembrane receptor (rhodopsin family) 228281 7 transmembrane receptor (rhodopsin family) 228282 7 transmembrane receptor (rhodopsin family) 228284 7 transmembrane receptor (rhodopsin family) 228285 7 transmembrane receptor (rhodopsin family) 228287 7 transmembrane receptor (rhodopsin family) 228288 7 transmembrane receptor (rhodopsin family) 228292 7 transmembrane receptor (rhodopsin family) 228294 7 transmembrane receptor (rhodopsin family) 228295 7 transmembrane receptor (rhodopsin family) 228298 7 transmembrane receptor (rhodopsin family) 228299 7 transmembrane receptor (rhodopsin family) 228300 7 transmembrane receptor (rhodopsin family) 228304 7 transmembrane receptor (rhodopsin family) 228307 7 transmembrane receptor (rhodopsin family) 228308 7 transmembrane receptor (rhodopsin family) 228311 7 transmembrane receptor (rhodopsin family) 228318 7 transmembrane receptor (rhodopsin family) 228319 7 transmembrane receptor (rhodopsin family) 228325 7 transmembrane receptor (rhodopsin family) 228337 7 transmembrane receptor (rhodopsin family) 228345 7 transmembrane receptor (rhodopsin family) 228355 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 228355 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 228355 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 228355 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 228355 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 228355 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 228357 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 228364 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 228364 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 228366 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl 228377 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 228382 Mitochondrial carrier protein 228408 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 228408 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 228413 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 228418 Mitochondrial carrier protein 228419 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 228421 Kinesin motor domain 228430 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 228435 7 transmembrane receptor (rhodopsin family) 228437 7 transmembrane receptor (rhodopsin family) 228438 7 transmembrane receptor (rhodopsin family) 228439 7 transmembrane receptor (rhodopsin family) 228441 7 transmembrane receptor (rhodopsin family) 228442 7 transmembrane receptor (rhodopsin family) 228443 7 transmembrane receptor (rhodopsin family) 228447 7 transmembrane receptor (rhodopsin family) 228448 7 transmembrane receptor (rhodopsin family) 228449 7 transmembrane receptor (rhodopsin family) 228454 7 transmembrane receptor (rhodopsin family) 228464 7 transmembrane receptor (rhodopsin family) 228465 7 transmembrane receptor (rhodopsin family) 228466 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 228466 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 228470 7 transmembrane receptor (rhodopsin family) 228475 7 transmembrane receptor (rhodopsin family) 228482 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 228507 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 228515 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 228549 Core histone H2A/H2B/H3/H4 228550 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 228564 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 228569 SH2 domain 228598 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 228602 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 228604 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 228608 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 228662 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 228666 ENV polyprotein (coat polyprotein) 228692 Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi 228710 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 228712 Ribosomal protein S5, C-terminal domain 228712 Ribosomal protein S5, N-terminal domain 228714 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 228714 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 228731 Homeobox domain 228731 Homeobox domain 228732 Cyclophilin type peptidyl-prolyl cis-trans isomerase 228756 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 228756 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 228776 Ribosomal protein L10 228776 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 228785 Protein kinase domain 228785 Phosphatidylinositol 3- and 4-kinase 228796 LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 228796 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 228798 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 228804 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 228809 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 228814 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 228836 Guanylate-kinase-associated protein (GKAP) protein 228861 WAP-type (Whey Acidic Protein) 'four-disulfide core' 228861 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 228869 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 228876 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 228880 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 228880 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 228893 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 228942 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 228960 Syntaxin 228961 Sec63 domain 228961 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 228966 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 228983 Oxysterol-binding protein 228993 Sugar (and other) transporter 228994 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 228994 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 228998 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 229003 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t 229004 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 229005 Protein kinase domain 229013 7 transmembrane receptor (rhodopsin family) 229017 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 229027 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 229038 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 229043 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 229051 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229051 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229055 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 229055 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 229064 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 229069 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 229070 Phosphoglycerate mutase family 229070 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 229076 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229076 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229087 HMG (high mobility group) box 229089 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229089 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229090 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229091 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229091 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229095 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 229096 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 229098 ENV polyprotein (coat polyprotein) 229105 Hsp90 protein 229109 Phosphoglycerate kinase 229121 Cyclophilin type peptidyl-prolyl cis-trans isomerase 229142 ENV polyprotein (coat polyprotein) 229152 Cullin family 229176 Ribosomal protein S6e 229189 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 229206 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 229207 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 229209 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229209 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229219 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229228 NUDIX domain 229228 NUDIX domain 229228 NUDIX domain 229269 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 229270 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 229272 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229272 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229274 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 229276 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229277 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 229278 Calx-beta domain 229280 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 229285 MIT domain 229309 Phosphoglycerate kinase 229323 7 transmembrane receptor (rhodopsin family) 229330 Carboxylesterase 229330 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 229333 Carboxylesterase 229333 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 229337 Carboxylesterase 229350 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229352 Hsp90 protein 229354 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 229354 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 229363 Glutamine amidotransferase class-I 229363 tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs 229363 GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E 229366 7 transmembrane receptor (metabotropic glutamate family) 229367 AMP-binding enzyme 229369 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 229371 Ribosomal protein S6e 229373 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 229384 Ribosomal protein L35Ae 229389 Ribosomal protein S6e 229389 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 229389 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 229391 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229413 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 229418 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 229419 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 229425 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229425 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229458 Cadherin domain 229459 Cadherin domain 229468 Ribosomal L28e protein family 229476 Tc5 transposase 229476 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 229476 CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues, 229476 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 229482 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229488 Androgen receptor 229500 Ets-domain 229504 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 229505 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 229506 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 229517 Mitochondrial carrier protein 229517 Mitochondrial carrier protein 229521 C2 domain 229539 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 229541 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 229541 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 229544 Ribosomal protein S5, C-terminal domain 229544 Ribosomal protein S5, N-terminal domain 229546 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 229547 PX domain. PX domains bind to phosphoinositides 229553 Macrophage migration inhibitory factor (MIF) 229560 Ribosomal protein L21e 229563 Ribosomal protein L13e 229569 Homeobox domain 229569 Integral membrane protein DUF110. This archaebacterial protein family has no known function. Some members of this family are annotated as FlaJ, however we can find no supporting evidence for this annotation 229570 14-3-3 protein 229571 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 229580 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 229584 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 229589 DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at 229607 Core histone H2A/H2B/H3/H4 229608 Core histone H2A/H2B/H3/H4 229608 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 229615 pfam02891, zf-MIZ, MIZ zinc finger 229615 pfam02891, zf-MIZ, MIZ zinc finger 229622 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 229626 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 229631 14-3-3 protein 229636 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 229639 L1 transposable element 229644 B-box zinc finger 229644 Filamin/ABP280 repeat 229644 B-box zinc finger 229644 Filamin/ABP280 repeat 229663 'Cold-shock' DNA-binding domain 229665 Adenosine/AMP deaminase 229681 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 229688 Glycosyl hydrolases family 18 229691 7 transmembrane receptor (rhodopsin family) 229697 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic 229699 Sugar (and other) transporter 229700 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 229706 Sodium:neurotransmitter symporter family 229709 S-adenosyl-L-homocysteine hydrolase 229710 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 229710 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 229714 7 transmembrane receptor (rhodopsin family) 229729 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 229729 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 229731 Mitochondrial carrier protein 229739 Conserved hypothetical protein 95 229739 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 229740 Ribosomal protein L34e 229746 Ribosomal family S4e 229751 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee 229751 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 229752 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee 229752 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 229756 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 229756 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 229772 Ets-domain 229776 Protein-tyrosine phosphatase 229776 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 229782 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 229785 Ribosomal protein S19e 229785 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 229791 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 229791 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 229799 Ribosomal protein L14p/L23e 229803 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229803 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229810 Enolase, C-terminal TIM barrel domain 229824 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 229837 Clathrin adaptor complex small chain 229837 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 229841 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 229841 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 229841 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 229851 Hsp90 protein 229851 Ribosomal L29e protein family 229857 Ribosomal protein L44 229869 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 229871 Zinc-binding dehydrogenase 229871 Prephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis 229879 Nucleoside diphosphate kinase 229893 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 229898 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 229898 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 229900 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 229900 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 229902 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 229902 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 229905 Aminotransferase class I and II 229905 DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC and StsS. The aminotransferase activity 229905 Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH 229910 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 229910 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 229920 Ribosomal protein L21e 229921 Ribosomal protein L6 229949 Adenylate kinase 229958 Protein-tyrosine phosphatase 229958 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 229979 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 229992 PWI domain 230002 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 230002 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 230002 Glucuronate isomerase. This is a family of Glucuronate isomerases also known as D-glucuronate isomerase, uronic isomerase, uronate isomerase, or uronic acid isomerase, EC:5.3.1.12. This enzyme catalyses the reactions: D-glucuronate <=> D-fructuronate and 230003 Ribosomal protein S17 230006 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 230010 Poly-adenylate binding protein, unique domain 230010 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 230013 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 230013 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 230025 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 230042 Ribosomal protein S6e 230043 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 230044 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230044 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230045 Hsp90 protein 230045 Macrophage migration inhibitory factor (MIF) 230046 Ribosomal protein S6e 230054 HMG (high mobility group) box 230069 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 230082 Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap 230098 PH domain. PH stands for pleckstrin homology 230098 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 230099 Eukaryotic-type carbonic anhydrase 230103 Protein kinase domain 230103 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 230103 Protein kinase domain 230103 Adenylate and Guanylate cyclase catalytic domain 230103 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 230119 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 230125 Mitochondrial carrier protein 230126 SH2 domain 230145 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 230163 Fructose-bisphosphate aldolase class-I 230173 ENV polyprotein (coat polyprotein) 230174 Elongation factor 1 gamma, conserved domain 230175 Ribosomal protein L13e 230185 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 230188 7 transmembrane receptor (rhodopsin family) 230195 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 230197 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 230220 Core histone H2A/H2B/H3/H4 230234 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 230235 DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions 230251 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 230251 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 230251 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 230253 Ribosomal protein S2 230257 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 230260 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 230260 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 230280 TNF(Tumor Necrosis Factor) family 230287 Enolase, C-terminal TIM barrel domain 230301 Ribosomal protein S5, N-terminal domain 230310 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 230316 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 230316 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 230337 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 230337 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 230341 Ribosomal protein L3 230344 Calx-beta domain 230346 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 230347 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230347 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230358 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230358 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230367 Ribosomal RNA adenine dimethylase 230396 Interferon alpha/beta domain 230396 Interferon alpha/beta domain 230397 Interferon alpha/beta domain 230398 Interferon alpha/beta domain 230401 Interferon alpha/beta domain 230405 Interferon alpha/beta domain 230405 Interferon alpha/beta domain 230411 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 230419 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230419 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230431 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230431 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230437 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230437 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230448 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 230459 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 230460 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 230461 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 230465 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 230466 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 230474 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 230483 Homeobox domain 230493 Ubiquitin carboxyl-terminal hydrolase, family 1 230494 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 230494 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 230514 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei 230558 Low-density lipoprotein receptor domain class A 230558 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb 230562 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 230582 Oxidoreductase FAD-binding domain 230582 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 230582 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 230592 Ribosomal protein L6 230594 PAP/25A associated domain 230594 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 230596 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou 230612 Sodium:solute symporter family 230616 Ribosomal protein L3 230621 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 230621 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 230638 Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found 230638 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 230639 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 230661 Protein kinase domain 230664 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 230664 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 230674 jmjN domain 230674 jmjC domain 230684 7 transmembrane receptor (rhodopsin family) 230690 7 transmembrane receptor (rhodopsin family) 230692 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 230692 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 230709 Peptidase family M48 230710 Palmitoyl protein thioesterase 230714 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 230715 Palmitoyl protein thioesterase 230715 ENV polyprotein (coat polyprotein) 230721 Poly-adenylate binding protein, unique domain 230724 Ribosomal protein S5, C-terminal domain 230724 Ribosomal protein S5, N-terminal domain 230727 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 230727 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. These proteins contain three strongly conserved histidines in the putative trans 230735 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 230735 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 230737 GTPase of unknown function 230758 Ribosomal protein S5, C-terminal domain 230758 Ribosomal protein S5, N-terminal domain 230760 CUB domain 230760 Sushi domain (SCR repeat) 230761 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 230765 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 230771 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 230775 7 transmembrane receptor (Secretin family) 230775 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 230775 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 230775 Thrombospondin type 1 domain 230775 7 transmembrane receptor (rhodopsin family) 230775 7 transmembrane receptor (Secretin family) 230775 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 230775 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 230777 7 transmembrane receptor (rhodopsin family) 230779 TMS membrane protein/tumour differentially expressed protein (TDE) 230796 TPR Domain 230796 WD domain, G-beta repeat 230799 14-3-3 protein 230800 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 230801 Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function 230801 Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function 230805 'Cold-shock' DNA-binding domain 230806 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 230810 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 230815 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 230827 Proteasome A-type and B-type 230828 Tissue factor 230837 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 230837 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 230837 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 230843 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 230843 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 230857 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be 230861 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA 230887 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 230890 Carboxylesterase 230891 Carboxylesterase 230896 Ribosomal L29e protein family 230899 Atrial natriuretic peptide 230903 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate 230904 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 230936 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 230972 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 230972 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 231002 PH domain. PH stands for pleckstrin homology 231003 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 231003 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 231006 ENV polyprotein (coat polyprotein) 231025 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 231025 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 231025 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 231031 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 231032 Cadherin domain 231032 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 231033 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 231037 Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto 231044 Homeobox domain 231045 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 231047 ENV polyprotein (coat polyprotein) 231050 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 231051 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 231051 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 231059 AMP-binding enzyme 231075 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 231075 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 231078 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 231081 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 231086 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 231098 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 231109 LIM domain. This family represents two copies of the LIM structural domain 231109 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 231118 Homeobox domain 231125 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 231126 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 231134 PTB domain (IRS-1 type) 231143 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 231148 Villin headpiece domain 231148 LIM domain. This family represents two copies of the LIM structural domain 231148 Villin headpiece domain 231151 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 231157 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 231178 Mitochondrial carrier protein 231193 Ribosomal L29e protein family 231207 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 231220 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 231227 ADP-ribosylation factor family 231227 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 231234 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 231237 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 231239 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 231241 PAP/25A associated domain 231242 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues 231248 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 231248 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 231251 Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relate 231252 Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 231252 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 231252 Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 231252 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 231269 Ribosomal protein L21e 231294 Ribosomal protein S5, C-terminal domain 231294 Ribosomal protein S5, N-terminal domain 231317 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 231317 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 231326 AMP-binding enzyme 231326 Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix b 231326 AMP-binding enzyme 231329 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com 231332 Death effector domain 231343 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 231343 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 231380 Repeat in ubiquitin-activating (UBA) protein 231380 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 231381 Trypsin 231382 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 231386 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 231386 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 231387 UDP-glucoronosyl and UDP-glucosyl transferase 231391 UDP-glucoronosyl and UDP-glucosyl transferase 231392 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 231393 UDP-glucoronosyl and UDP-glucosyl transferase 231394 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 231396 UDP-glucoronosyl and UDP-glucosyl transferase 231396 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 231397 UDP-glucoronosyl and UDP-glucosyl transferase 231398 UDP-glucoronosyl and UDP-glucosyl transferase 231407 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 231408 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 231411 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 231413 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 231430 60Kd inner membrane protein 231444 Eukaryotic porin 231446 Cyclophilin type peptidyl-prolyl cis-trans isomerase 231462 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 231467 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 231470 Calx-beta domain 231470 von Willebrand factor type A domain 231470 Giardia variant-specific surface protein 231470 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 231470 Calx-beta domain 231474 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 231484 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 231484 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 231501 Ribosomal L29e protein family 231506 Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in 231547 ENV polyprotein (coat polyprotein) 231552 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 231552 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 231558 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 231563 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 231563 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 231565 Zona pellucida-like domain 231571 Domain of Unknown Function (DUF408) 231573 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 231580 Protein kinase domain 231580 Protein-tyrosine phosphatase 231580 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 231583 Sulfate transporter family. Mutations may lead to several human diseases 231586 7 transmembrane receptor (metabotropic glutamate family) 231586 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 231586 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 231587 7 transmembrane receptor (metabotropic glutamate family) 231588 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 231589 7 transmembrane receptor (metabotropic glutamate family) 231589 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 231591 7 transmembrane receptor (metabotropic glutamate family) 231591 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 231594 Actin 231600 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 231602 ATP P2X receptor 231605 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 231613 Uncharacterized ACR, COG1579 231613 Methyl-accepting chemotaxis protein (MCP) signaling domain. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs 231613 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E 231613 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 231613 IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation 231613 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 231613 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 231613 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 231655 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 231663 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 231663 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 231666 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 231707 Cyclophilin type peptidyl-prolyl cis-trans isomerase 231715 Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold 231715 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 231717 PH domain. PH stands for pleckstrin homology 231724 Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage 231727 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 231736 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 231760 Fibronectin type III domain 231760 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 231769 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 231772 Ribosomal protein S5, C-terminal domain 231772 Ribosomal protein S5, N-terminal domain 231802 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 231803 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 231810 Aminotransferase class I and II 231812 7 transmembrane receptor (metabotropic glutamate family) 231812 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 231821 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 231822 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 231830 LIM domain. This family represents two copies of the LIM structural domain 231832 Domain of unknown function 231834 PX domain. PX domains bind to phosphoinositides 231836 Kinesin motor domain 231837 Core histone H2A/H2B/H3/H4 231858 DIL domain. The DIL domain has no known function 231858 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 231863 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 231864 7 transmembrane receptor (rhodopsin family) 231866 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 231869 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 231869 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 231889 G10 protein 231905 Ribosomal L18ae protein family 231914 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 231923 7 transmembrane receptor (metabotropic glutamate family) 231923 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 231931 GTPase of unknown function 231942 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 231949 Ribosomal protein S6e 231959 Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production 231999 PH domain. PH stands for pleckstrin homology 232028 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232031 7 transmembrane receptor (rhodopsin family) 232031 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232040 ENV polyprotein (coat polyprotein) 232044 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 232044 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 232045 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232047 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 232048 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232052 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232053 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232055 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232059 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232060 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232063 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232065 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232066 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232067 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232068 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232070 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 232072 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232073 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 232074 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232077 Fork head domain 232078 Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni 232094 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 232104 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 232104 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 232110 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 232110 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 232118 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 232119 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 232119 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 232143 HMG (high mobility group) box 232156 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 232157 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru 232165 Ribosomal protein S8e 232174 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 232180 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 232190 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 232201 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 232201 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 232206 Ribosomal protein L3 232210 Uncharacterized ACR, COG2135 232211 ENV polyprotein (coat polyprotein) 232227 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 232235 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 232235 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 232237 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit 232259 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 232259 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 232273 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 232288 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 232288 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 232312 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 232320 Cyclophilin type peptidyl-prolyl cis-trans isomerase 232322 Actin 232327 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 232333 Sodium:neurotransmitter symporter family 232333 Sodium:neurotransmitter symporter family 232336 Lipoxygenase 232336 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 232338 Nebulin repeat 232338 LIM domain. This family represents two copies of the LIM structural domain 232338 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 232339 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 232339 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 232339 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 232345 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 232345 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 232345 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 232345 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 232354 Ribosomal protein S5, C-terminal domain 232354 Ribosomal protein S5, N-terminal domain 232357 Cyclophilin type peptidyl-prolyl cis-trans isomerase 232358 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232360 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232361 Apoptosis regulator proteins, Bcl-2 family 232361 7 transmembrane receptor (metabotropic glutamate family) 232363 7 transmembrane receptor (metabotropic glutamate family) 232363 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232364 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 232367 7 transmembrane receptor (metabotropic glutamate family) 232367 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232368 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 232371 Trypsin 232371 CUB domain 232378 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 232391 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 232391 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 232399 Ribosomal protein L21e 232400 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 232400 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 232401 VHS domain. Domain present in VPS-27, Hrs and STAM 232401 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 232404 ENV polyprotein (coat polyprotein) 232415 Lectin C-type domain. This family includes both long and short form C-type 232421 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 232431 7 transmembrane receptor (metabotropic glutamate family) 232431 7 transmembrane receptor (metabotropic glutamate family) 232433 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 232433 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 232440 Core histone H2A/H2B/H3/H4 232441 ADP-ribosylation factor family 232441 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 232465 Cullin family 232466 Cullin family 232485 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 232485 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 232487 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 232491 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 232493 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari 232510 Ribosomal protein S6e 232515 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 232515 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 232527 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 232527 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 232534 Ribosomal protein S5, C-terminal domain 232534 Ribosomal protein S5, N-terminal domain 232539 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 232539 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 232539 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 232560 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 232569 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 232578 Polyprenyl synthetase 232581 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 232585 von Willebrand factor type D domain 232587 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 232587 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 232597 AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4- 232598 LIM domain. This family represents two copies of the LIM structural domain 232616 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 232619 Nucleoside diphosphate kinase 232626 Ribosomal protein L21e 232637 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 232647 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 232647 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 232649 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 232665 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 232670 Tetraspanin family 232680 Zinc carboxypeptidase 232680 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 232683 Proteasome A-type and B-type 232706 Fructose-bisphosphate aldolase class-I 232706 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 232710 PMP-22/EMP/MP20/Claudin family 232714 Trefoil (P-type) domain 232714 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 232717 Trypsin 232717 Trypsin 232718 Trypsin 232724 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232727 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232728 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232733 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 232733 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 232744 7 transmembrane receptor (rhodopsin family) 232745 Polysaccharide deacetylase. This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase. It also includes chitin deacetylase 232745 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 232747 7 transmembrane receptor (rhodopsin family) 232750 7 transmembrane receptor (rhodopsin family) 232752 7 transmembrane receptor (rhodopsin family) 232758 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232759 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 232762 7 transmembrane receptor (rhodopsin family) 232774 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 232776 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 232780 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 232781 Ribosomal protein L31e 232782 Ribosomal protein L35Ae 232784 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 232786 Ribosomal protein S5, N-terminal domain 232787 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 232791 Not1 N-terminal domain, CCR4-Not complex component 232791 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 232798 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 232804 ATP synthase subunit C 232815 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 232815 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 232822 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 232823 Actin 232825 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232826 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 232828 7 transmembrane receptor (metabotropic glutamate family) 232828 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232838 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 232852 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 232854 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 232860 Protein kinase domain 232860 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 232862 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 232862 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 232862 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 232867 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 232867 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 232868 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 232871 7 transmembrane receptor (metabotropic glutamate family) 232871 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232873 7 transmembrane receptor (metabotropic glutamate family) 232873 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 232875 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 232878 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 232878 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 232879 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 232879 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 232883 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232883 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 232886 ENV polyprotein (coat polyprotein) 232887 WD domain, G-beta repeat 232890 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 232892 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 232895 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 232899 Homeobox domain 232900 Homeobox domain 232901 Homeobox domain 232910 Clathrin adaptor complex small chain 232910 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 232920 Cyclophilin type peptidyl-prolyl cis-trans isomerase 232923 ENV polyprotein (coat polyprotein) 232923 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 232926 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 232926 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 232930 GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 232930 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 232937 7 transmembrane receptor (Secretin family) 232941 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 232944 Kinase associated domain 1 232947 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 232953 7 transmembrane receptor (rhodopsin family) 232953 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232954 Intermediate filament protein 232958 Protein kinase domain 232959 7 transmembrane receptor (rhodopsin family) 232959 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232961 7 transmembrane receptor (rhodopsin family) 232961 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232962 7 transmembrane receptor (rhodopsin family) 232962 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 232963 Protein kinase domain 232974 Ets-domain 232978 Protein kinase domain 232984 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc 232993 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 232994 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 232995 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 232998 Ubiquitin carboxyl-terminal hydrolases family 2 233003 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 233005 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 233006 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 233007 ENV polyprotein (coat polyprotein) 233011 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 233011 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 233017 Phosphoglycerate mutase family 233017 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 233022 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 233022 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 233024 Protein kinase domain 233024 Octicosapeptide repeat. Short motif that may bind Ca2+ 233024 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 233033 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 233033 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 233033 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 233038 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 233040 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 233046 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 233053 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 233055 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 233055 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 233057 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 233061 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 233067 Fibronectin type III domain 233067 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 233067 Fibronectin type III domain 233067 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 233067 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 233069 Protein kinase domain 233071 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 233071 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 233075 Ribosomal L18ae protein family 233079 7 transmembrane receptor (rhodopsin family) 233080 7 transmembrane receptor (rhodopsin family) 233081 7 transmembrane receptor (rhodopsin family) 233086 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 233087 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 233089 Ribosomal protein L23 233098 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 233107 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 233111 CUB domain 233121 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233121 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233131 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 233147 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 233166 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 233168 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 233169 ENV polyprotein (coat polyprotein) 233169 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 233173 Fusion glycoprotein F0 233173 ENV polyprotein (coat polyprotein) 233173 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 233174 HMG (high mobility group) box 233176 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 233177 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233177 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233179 7 transmembrane receptor (metabotropic glutamate family) 233179 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233181 ENV polyprotein (coat polyprotein) 233181 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 233182 Ribosomal L18ae protein family 233183 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 233184 Spumavirus gag protein 233184 Granin (chromogranin or secretogranin) 233184 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 233184 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 233187 PMP-22/EMP/MP20/Claudin family 233187 PMP-22/EMP/MP20/Claudin family 233189 Uncharacterized protein family UPF0021 233190 Trypsin 233196 IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing 233220 7 transmembrane receptor (rhodopsin family) 233221 7 transmembrane receptor (rhodopsin family) 233222 7 transmembrane receptor (rhodopsin family) 233224 7 transmembrane receptor (rhodopsin family) 233227 7 transmembrane receptor (rhodopsin family) 233230 7 transmembrane receptor (rhodopsin family) 233230 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233231 7 transmembrane receptor (rhodopsin family) 233249 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233249 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233254 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 233254 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 233256 ENV polyprotein (coat polyprotein) 233260 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 233272 Ribosomal protein S7e 233276 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 233281 Regulator of chromosome condensation (RCC1) 233281 Anaphase-promoting complex, subunit 10 (APC10) 233281 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 233281 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 233297 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 233298 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 233298 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 233298 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 233330 Homeobox domain 233330 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 233332 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 233335 Intermediate filament protein 233336 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233336 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233352 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233352 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233355 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233355 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233401 Uracil DNA glycosylase superfamily 233405 Sec1 family 233405 Sec1 family 233406 Microtubule associated protein (MAP65/ASE1 family) 233412 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidine 233412 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins. One member is a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane. One member is a a Na+-dependent nucleoside transporter se 233426 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 233426 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 233427 Ribosomal protein S28e 233436 7 transmembrane receptor (rhodopsin family) 233437 7 transmembrane receptor (metabotropic glutamate family) 233437 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233438 7 transmembrane receptor (metabotropic glutamate family) 233439 7 transmembrane receptor (metabotropic glutamate family) 233440 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233441 7 transmembrane receptor (metabotropic glutamate family) 233443 7 transmembrane receptor (metabotropic glutamate family) 233443 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233444 7 transmembrane receptor (metabotropic glutamate family) 233445 7 transmembrane receptor (metabotropic glutamate family) 233445 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233448 7 transmembrane receptor (metabotropic glutamate family) 233448 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233452 7 transmembrane receptor (metabotropic glutamate family) 233452 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233453 7 transmembrane receptor (rhodopsin family) 233453 7 transmembrane receptor (metabotropic glutamate family) 233453 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233454 7 transmembrane receptor (rhodopsin family) 233457 7 transmembrane receptor (rhodopsin family) 233462 7 transmembrane receptor (rhodopsin family) 233465 7 transmembrane receptor (rhodopsin family) 233466 7 transmembrane receptor (metabotropic glutamate family) 233466 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233469 7 transmembrane receptor (metabotropic glutamate family) 233469 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 233471 Ribosomal family S4e 233476 Ribosomal protein S17 233479 Ribosomal protein S17 233485 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233485 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233489 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 233493 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 233493 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 233508 14-3-3 protein 233529 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 233531 Ribosomal L15 233532 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 233537 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 233537 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 233542 ENV polyprotein (coat polyprotein) 233545 ENT domain. This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown 233549 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 233549 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 233552 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 233564 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 233564 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 233566 Ribosomal protein S8e 233571 7 transmembrane receptor (rhodopsin family) 233576 7 transmembrane receptor (rhodopsin family) 233578 7 transmembrane receptor (rhodopsin family) 233587 7 transmembrane receptor (rhodopsin family) 233592 7 transmembrane receptor (rhodopsin family) 233595 7 transmembrane receptor (rhodopsin family) 233596 7 transmembrane receptor (rhodopsin family) 233600 7 transmembrane receptor (rhodopsin family) 233601 7 transmembrane receptor (rhodopsin family) 233609 7 transmembrane receptor (rhodopsin family) 233613 7 transmembrane receptor (rhodopsin family) 233616 7 transmembrane receptor (rhodopsin family) 233620 7 transmembrane receptor (rhodopsin family) 233634 Ribonucleotide reductase, small chain 233635 7 transmembrane receptor (rhodopsin family) 233636 7 transmembrane receptor (rhodopsin family) 233636 Ubiquitin carboxyl-terminal hydrolase family 2 233636 Ubiquitin carboxyl-terminal hydrolases family 2 233636 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 233637 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 233642 7 transmembrane receptor (rhodopsin family) 233644 7 transmembrane receptor (rhodopsin family) 233645 7 transmembrane receptor (rhodopsin family) 233646 7 transmembrane receptor (rhodopsin family) 233649 Cyclic nucleotide-binding domain 233649 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 233650 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 233651 Cadherin domain 233665 7 transmembrane receptor (rhodopsin family) 233666 7 transmembrane receptor (rhodopsin family) 233669 'chromo' (CHRromatin Organization MOdifier) domain 233669 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 233670 7 transmembrane receptor (rhodopsin family) 233681 7 transmembrane receptor (rhodopsin family) 233688 Cyclophilin type peptidyl-prolyl cis-trans isomerase 233690 7 transmembrane receptor (rhodopsin family) 233695 7 transmembrane receptor (rhodopsin family) 233695 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 233699 7 transmembrane receptor (rhodopsin family) 233708 7 transmembrane receptor (rhodopsin family) 233709 7 transmembrane receptor (rhodopsin family) 233711 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 233712 7 transmembrane receptor (rhodopsin family) 233721 Aminotransferase class-V 233726 Importin-beta N-terminal domain 233726 Importin-beta N-terminal domain 233733 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 233733 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 233735 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 233744 Reeler domain 233744 Thrombospondin type 1 domain 233747 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 233781 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 233781 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 233789 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 233789 Bunyavirus glycoprotein G1. Bunyavirus has three genomic segments: small (S), middle-sized (M), and large (L). The S segment encodes the nucleocapsid and a non-structural protein. The M segment codes for two glycoproteins, G1 and G2, and another non-struc 233801 AMP-binding enzyme 233802 pfam02926, THUMP, THUMP domain. The THUMP domain is named after after thiouridine synthases, methylases and PSUSs. The THUMP domain consists of about 110 amino acid residues. It is predicted that this domain is an RNA-binding domain that adopts an alpha/ 233805 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 233808 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 233810 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 233813 von Willebrand factor type A domain 233815 Cyclophilin type peptidyl-prolyl cis-trans isomerase 233830 Tubulin binding cofactor A 233832 Zinc finger, C3HC4 type (RING finger) 233832 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 233836 Sodium:solute symporter family 233858 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 233858 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 233859 Glutamate/Leucine/Phenylalanine/Valine dehydrogenase 233861 Ribosomal protein S7e 233877 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 233878 Sushi domain (SCR repeat) 233885 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 233888 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 233890 Vinculin family 233892 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 233902 F-box domain 233902 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 233904 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom 233904 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 233908 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 233910 B-box zinc finger 233910 Zinc finger, C3HC4 type (RING finger) 233910 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 233919 7 transmembrane receptor (rhodopsin family) 233921 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 233926 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233926 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233938 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 233952 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 233952 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 233960 Ribosomal protein L21e 233977 Uncharacterized ACR, COG1579 233977 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 233977 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 233977 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 233979 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 233987 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 233991 Ribosomal L22e protein family 233992 Ribosomal protein L21e 233995 Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate 234004 Eukaryotic porin 234072 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi 234073 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 234097 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 234097 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 234100 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 234107 CUB domain 234120 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 234129 HMG14 and HMG17 234132 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 234135 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 234158 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234158 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234187 Eukaryotic porin 234188 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 234188 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 234190 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 234190 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 234201 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234201 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234209 Cyclophilin type peptidyl-prolyl cis-trans isomerase 234214 Sorbin homologous domain 234214 Sorbin homologous domain 234214 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 234217 Actin 234218 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 234219 Helix-loop-helix DNA-binding domain 234219 Helix-loop-helix DNA-binding domain 234247 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234247 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234249 ENV polyprotein (coat polyprotein) 234254 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 234267 Myelin proteolipid protein (PLP or lipophilin) 234286 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234292 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 234309 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 234313 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 234319 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234319 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234321 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 234329 B-box zinc finger 234333 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 234353 PH domain. PH stands for pleckstrin homology 234358 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 234362 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 234365 YjeF-related protein N-terminus 234373 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 234373 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 234374 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 234374 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 234378 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 234378 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 234378 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 234378 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 234384 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 234400 14-3-3 protein 234404 SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c o 234407 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va 234415 7 transmembrane receptor (rhodopsin family) 234417 7 transmembrane receptor (rhodopsin family) 234419 Tropomyosin 234419 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 234425 Intermediate filament protein 234427 Sushi domain (SCR repeat) 234429 Protein of unknown function, DUF259 234438 Phosphoribulokinase / Uridine kinase family 234440 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234440 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234441 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 234445 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 234448 Ribosomal protein L13e 234456 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 234460 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 234463 Domain of unknown function 234474 Ribosomal protein L21e 234483 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 234486 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a 234501 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234501 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234505 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234505 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234520 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234520 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234542 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc 234546 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 234550 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234563 Carboxylesterase 234564 Carboxylesterase 234576 Clathrin adaptor complex small chain 234576 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 234577 C2 domain 234593 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t 234594 CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID 234595 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 234611 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 234612 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 234612 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 234622 DNA directed RNA polymerase, 7 kDa subunit 234628 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 234628 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 234628 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 234628 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 234635 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 234640 Ribosomal protein L10 234663 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 234664 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 234667 Carboxylesterase 234668 Carboxylesterase 234669 Carboxylesterase 234671 Carboxylesterase 234672 Carboxylesterase 234673 Carboxylesterase 234675 Carboxylesterase 234676 Carboxylesterase 234677 Carboxylesterase 234678 Uncharacterised protein family (UPF0183). This family of proteins includes Lin-10 from C. elegans 234684 F-box domain 234690 Ribosomal protein S5, C-terminal domain 234690 Ribosomal protein S5, N-terminal domain 234699 WD domain, G-beta repeat 234702 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 234703 Ribosomal L10 234703 Ribosomal protein L16 234723 Mitosis protein DIM1 234723 Mitosis protein DIM1 234723 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 234725 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 234728 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl 234751 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 234779 SH2 domain 234779 PH domain. PH stands for pleckstrin homology 234779 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 234793 Adenosine-deaminase (editase) domain 234796 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 234796 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 234826 ENV polyprotein (coat polyprotein) 234848 Ribosomal protein L13e 234852 Eukaryotic protein of unknown function, DUF279 234854 Protein kinase domain 234854 Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function 234858 ENV polyprotein (coat polyprotein) 234858 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 234858 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 234858 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 234865 Nup133 nucleoporin. RNA undergoing nuclear export first encounters the basket of the nuclear pore. Nup133 is a nucleoporin accessible on the basket side of the pore 234877 Transcription factor TFIIB repeat 234878 Protein kinase domain 234882 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 234885 7 transmembrane receptor (rhodopsin family) 234886 ENV polyprotein (coat polyprotein) 234886 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 234886 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 234886 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 234886 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 234886 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 234886 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 234889 Adenylate and Guanylate cyclase catalytic domain 234892 ENV polyprotein (coat polyprotein) 234893 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 234897 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 234902 Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 234902 pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 234921 ENV polyprotein (coat polyprotein) 234922 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 234928 ENV polyprotein (coat polyprotein) 234929 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 234929 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 234929 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 234930 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 234930 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 234930 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 234932 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 234939 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 234955 Macrophage migration inhibitory factor (MIF) 234961 ENV polyprotein (coat polyprotein) 234967 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 234973 Cadherin domain 234981 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 234987 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 234997 7 transmembrane receptor (rhodopsin family) 235005 7 transmembrane receptor (rhodopsin family) 235019 7 transmembrane receptor (rhodopsin family) 235019 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 235019 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 235021 7 transmembrane receptor (rhodopsin family) 235025 7 transmembrane receptor (rhodopsin family) 235026 7 transmembrane receptor (rhodopsin family) 235033 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 235036 Second step splicing factor 1. SSF is thought to bind rRNA and similarity with peter pan in Drosophila suggests a role in cell proliferation 235040 Peptidase family C54 235044 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 235048 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 235072 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 235075 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 235075 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 235078 ENV polyprotein (coat polyprotein) 235086 Fibronectin type III domain 235086 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235087 Protein kinase domain 235104 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235106 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235130 Thrombospondin type 1 domain 235130 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 235132 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 235145 Triosephosphate isomerase 235146 Ribosomal protein L21e 235175 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 235186 7 transmembrane receptor (rhodopsin family) 235195 7 transmembrane receptor (rhodopsin family) 235196 7 transmembrane receptor (rhodopsin family) 235206 7 transmembrane receptor (rhodopsin family) 235207 7 transmembrane receptor (rhodopsin family) 235221 7 transmembrane receptor (rhodopsin family) 235226 7 transmembrane receptor (rhodopsin family) 235231 7 transmembrane receptor (rhodopsin family) 235234 7 transmembrane receptor (rhodopsin family) 235235 Actin 235236 7 transmembrane receptor (rhodopsin family) 235237 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 235237 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 235237 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 235241 7 transmembrane receptor (rhodopsin family) 235243 7 transmembrane receptor (rhodopsin family) 235246 7 transmembrane receptor (rhodopsin family) 235247 7 transmembrane receptor (rhodopsin family) 235248 7 transmembrane receptor (rhodopsin family) 235254 7 transmembrane receptor (rhodopsin family) 235255 7 transmembrane receptor (rhodopsin family) 235256 7 transmembrane receptor (rhodopsin family) 235257 7 transmembrane receptor (rhodopsin family) 235260 7 transmembrane receptor (rhodopsin family) 235266 7 transmembrane receptor (rhodopsin family) 235277 7 transmembrane receptor (rhodopsin family) 235280 7 transmembrane receptor (rhodopsin family) 235283 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 235293 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. 235307 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 235320 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 235320 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 235323 Ubiquitin carboxyl-terminal hydrolase family 2 235323 Ubiquitin carboxyl-terminal hydrolase family 2 235323 Ubiquitin carboxyl-terminal hydrolases family 2 235336 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 235336 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 235339 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 235341 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 235344 Protein kinase domain 235353 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 235353 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 235364 NAC domain 235379 Olfactomedin-like domain 235379 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 235379 Olfactomedin-like domain 235379 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 235384 ADP-ribosylation factor family 235384 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 235384 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 235385 Aconitase family (aconitate hydratase) 235385 Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism 235386 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 235392 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 235392 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 235394 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 235406 PX domain. PX domains bind to phosphoinositides 235406 PX domain. PX domains bind to phosphoinositides 235406 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 235412 Ribosomal protein S8e 235424 Ribosomal protein L5 235424 ribosomal L5P family C-terminus. This region is found associated with pfam00281 235427 GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 235427 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 235435 Glycosyl hydrolase family 1 235440 WD domain, G-beta repeat 235440 Regulator of chromosome condensation (RCC1) 235440 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 235442 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 235444 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 235444 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 235459 Transcription initiation factor IIA, gamma subunit, beta-barrel domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multi 235459 Transcription initiation factor IIA, gamma subunit, helical domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multimeri 235472 Fibronectin type III domain 235472 Fibronectin type III domain 235472 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235474 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235480 C2 domain 235486 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 235497 Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined 235497 Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription 235501 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 235505 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 235505 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 235506 Cyclophilin type peptidyl-prolyl cis-trans isomerase 235509 Fructose-bisphosphate aldolase class-I 235513 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 235517 HMG (high mobility group) box 235523 Ribosomal protein L6 235523 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 235526 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 235527 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve 235527 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 235533 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 235533 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 235534 Histidine acid phosphatase 235554 ADP-ribosylation factor family 235554 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 235555 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 235559 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 235559 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 235567 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 235574 E1-E2 ATPase 235574 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 235574 Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 235574 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 235575 Ribosomal L10 235575 Ribosomal protein L16 235580 von Willebrand factor type A domain 235581 Ribosomal protein L6e 235584 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 235587 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri 235587 pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini 235611 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 235611 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 235612 F-box domain 235623 WD domain, G-beta repeat 235623 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 235624 Protein-tyrosine phosphatase 235626 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 235633 Vacuolar sorting protein 9 (VPS9) domain 235636 Microtubule associated protein (MAP65/ASE1 family) 235641 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235663 Ribosomal protein S5, N-terminal domain 235663 Ribosomal protein S5, C-terminal domain 235680 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 235693 7 transmembrane receptor (rhodopsin family) 235712 7 transmembrane receptor (rhodopsin family) 235714 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235718 Protein kinase domain 235741 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 235748 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 235748 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 235758 GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 235758 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 235759 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 235762 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235790 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 235795 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235796 Defensin propeptide 235800 Interferon alpha/beta domain 235805 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235806 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235812 Phosphoglycerate mutase family 235812 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 235820 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 235841 Homeobox domain 235847 7 transmembrane receptor (rhodopsin family) 235852 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 235854 7 transmembrane receptor (rhodopsin family) 235856 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 235869 7 transmembrane receptor (rhodopsin family) 235870 7 transmembrane receptor (rhodopsin family) 235872 7 transmembrane receptor (metabotropic glutamate family) 235876 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 235881 Protein kinase domain 235882 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 235887 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 235890 Sugar (and other) transporter 235891 Sulfotransferase protein 235896 Ribosomal protein S19e 235896 Domain of unknown function DUF128. This archaebacterial protein family has no known function. The domain is found duplicated in some members 235897 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 235900 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 235904 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 235904 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 235904 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 235904 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 235908 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235909 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 235910 7 transmembrane receptor (metabotropic glutamate family) 235910 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 235916 7 transmembrane receptor (metabotropic glutamate family) 235917 Ribosomal protein L3 235923 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 235925 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 235927 7 transmembrane receptor (metabotropic glutamate family) 235927 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 235928 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 235932 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 235932 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 235934 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235938 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235949 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 235950 7 transmembrane receptor (rhodopsin family) 235950 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 235952 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 235953 14-3-3 protein 235956 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 235962 7 transmembrane receptor (rhodopsin family) 235964 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 235966 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 235976 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 235979 Ribosomal protein S5, C-terminal domain 235979 Ribosomal protein S5, N-terminal domain 235981 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 235982 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 235984 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 235985 HMG (high mobility group) box 235992 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 235992 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 235994 7 transmembrane receptor (metabotropic glutamate family) 235997 7 transmembrane receptor (metabotropic glutamate family) 236010 Uncharacterised protein family (UPF0083) 236011 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 236011 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 236024 7 transmembrane receptor (rhodopsin family) 236025 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236032 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 236033 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 236035 Polyprenyl synthetase 236039 Ribosomal protein S7e 236047 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236060 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 236066 Ribosomal protein L10 236068 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 236077 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 236079 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236082 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 236111 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 236117 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236127 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 236133 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236137 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 236145 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 236149 Sugar (and other) transporter 236150 Protein kinase domain 236150 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 236151 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 236152 ENV polyprotein (coat polyprotein) 236160 Ribosomal protein S7e 236164 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 236182 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 236193 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 236196 Sulfotransferase protein 236207 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236210 Somatotropin hormone family 236212 7 transmembrane receptor (metabotropic glutamate family) 236214 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 236217 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 236223 Proteasome A-type and B-type 236225 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 236226 Protein kinase domain 236227 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 236228 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 236232 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236232 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236238 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 236251 Palmitoyl protein thioesterase 236271 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 236271 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 236272 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 236286 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 236293 Sugar (and other) transporter 236294 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 236300 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236300 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236302 7 transmembrane receptor (rhodopsin family) 236302 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 236304 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236304 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236307 HMG (high mobility group) box 236312 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 236312 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 236317 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 236321 ENV polyprotein (coat polyprotein) 236325 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 236334 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 236334 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 236342 Protein kinase domain 236347 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 236347 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 236359 Eukaryotic initiation factor 1A 236364 ENV polyprotein (coat polyprotein) 236364 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 236364 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 236364 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 236369 Ribosomal protein L21e 236370 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 236372 7 transmembrane receptor (metabotropic glutamate family) 236393 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 236394 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236395 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 236401 7 transmembrane receptor (metabotropic glutamate family) 236407 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236413 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 236415 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236416 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236420 Disintegrin 236420 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 236420 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 236427 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236428 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 236428 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 236437 7 transmembrane receptor (rhodopsin family) 236437 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 236452 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 236452 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 236456 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236463 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 236469 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 236471 Somatotropin hormone family 236473 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 236476 Cadherin domain 236479 Tropomyosin 236490 7 transmembrane receptor (rhodopsin family) 236509 7 transmembrane receptor (rhodopsin family) 236511 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 236511 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 236517 ENV polyprotein (coat polyprotein) 236518 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 236520 Uncharacterized ACR, COG1579 236524 Ribosomal protein L6e 236526 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 236533 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 236534 Ribosomal protein L21e 236535 7 transmembrane receptor (rhodopsin family) 236535 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 236552 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 236554 Ribosomal protein L21e 236557 Zinc carboxypeptidase 236565 Ribosomal protein L3 236573 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 236573 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 236573 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 236573 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 236574 Protein kinase domain 236579 Ribosomal protein S7e 236583 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 236605 Ribosomal protein S7e 236612 Ribosomal protein L3 236622 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 236627 Proteasome A-type and B-type 236629 Actin 236632 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 236640 ENV polyprotein (coat polyprotein) 236643 C2 domain 236643 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 236643 C2 domain 236644 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 236649 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 236650 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 236651 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 236657 Cyclophilin type peptidyl-prolyl cis-trans isomerase 236663 Core histone H2A/H2B/H3/H4 236663 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 236676 Cyclophilin type peptidyl-prolyl cis-trans isomerase 236682 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 236683 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236690 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 236698 Ribosomal protein L6e 236702 Skp1 family, tetramerisation domain 236704 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 236704 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 236720 Zinc-binding dehydrogenase 236721 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 236725 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 236727 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 236727 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 236729 Core histone H2A/H2B/H3/H4 236730 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 236737 Sodium / potassium ATPase beta chain 236742 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236742 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236747 Zn-finger in Ran binding protein and others 236747 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 236748 Cyclophilin type peptidyl-prolyl cis-trans isomerase 236749 LIM domain. This family represents two copies of the LIM structural domain 236750 ENV polyprotein (coat polyprotein) 236762 Homeobox domain 236765 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 236766 Ribosomal protein S2 236767 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 236770 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 236774 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 236775 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236775 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236778 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 236781 7 transmembrane receptor (rhodopsin family) 236781 7 transmembrane receptor (rhodopsin family) 236784 7 transmembrane receptor (rhodopsin family) 236785 7 transmembrane receptor (rhodopsin family) 236786 Ribosomal protein L36e 236787 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 236787 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 236794 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 236798 7 transmembrane receptor (Secretin family) 236798 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 236799 7 transmembrane receptor (Secretin family) 236805 Fructose-bisphosphate aldolase class-I 236807 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 236820 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 236823 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236831 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 236842 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236843 Ribosomal protein L21e 236844 Ribosomal L22e protein family 236845 RanBP1 domain 236846 Eukaryotic ribosomal protein L18 236852 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236858 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 236863 Ribosomal protein S5, C-terminal domain 236863 Ribosomal protein S5, N-terminal domain 236864 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 236866 7 transmembrane receptor (rhodopsin family) 236868 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 236869 Ribosomal protein S19 236871 HMG (high mobility group) box 236872 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 236874 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 236875 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 236876 ENV polyprotein (coat polyprotein) 236876 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 236876 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 236876 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 236876 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 236876 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 236882 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 236882 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 236882 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 236882 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 236882 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 236882 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 236884 Ribosomal L10 236887 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236887 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236891 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236892 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236893 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 236899 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 236900 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 236904 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 236912 Ribosomal protein L13e 236913 Proteasome A-type and B-type 236913 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 236914 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 236915 PH domain. PH stands for pleckstrin homology 236915 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 236915 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 236918 Hsp90 protein 236920 START domain 236920 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 236922 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 236928 Translationally controlled tumor protein 236929 HMG (high mobility group) box 236932 Ribosomal protein S6e 236961 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236968 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 236969 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 236973 Zinc finger, C3HC4 type (RING finger) 236973 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 236973 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 236975 NAD-dependent glycerol-3-phosphate dehydrogenase 236978 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 236978 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 236983 S-adenosyl-L-homocysteine hydrolase 236986 Fructose-bisphosphate aldolase class-I 236991 Hsp90 protein 236994 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 236994 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 237004 HMG (high mobility group) box 237005 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 237008 Phosphorylase family 2 237009 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 237010 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 237012 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 237012 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 237012 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 237014 metallopeptidase family M24 237016 Pyruvate kinase, alpha/beta domain 237016 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 237016 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 237018 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 237024 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 237025 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 237025 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 237025 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 237029 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 237029 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 237030 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 237032 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 237035 Fructose-bisphosphate aldolase class-I 237036 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 237038 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 237050 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 237068 Ribosomal family S4e 237081 WD domain, G-beta repeat 237083 Ribosomal protein L14p/L23e 237085 Ribosomal protein L10 237093 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids, some interacts with c-Myc and repress its transcriptional activity. Suggesting that other members of this family may also regulate transcription by intera 237093 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b 237096 Actin 237105 Ribosomal protein L13e 237105 Sodium:neurotransmitter symporter family 237107 GTPase of unknown function 237107 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 237108 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 237115 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 237116 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 237119 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 237119 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 237124 Protein kinase domain 237130 Fork head domain 237132 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 237132 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 237140 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 237155 Fructose-bisphosphate aldolase class-I 237156 Ribosomal protein L6 237159 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domai 237162 Ribosomal protein S2 237172 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 237175 7 transmembrane receptor (Secretin family) 237175 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 237175 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 237178 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 237178 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 237188 Protein kinase domain 237200 Helix-loop-helix DNA-binding domain 237201 Ribosomal L29e protein family 237212 Macrophage migration inhibitory factor (MIF) 237213 Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 237213 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 237213 Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 237213 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 237228 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 237241 Core histone H2A/H2B/H3/H4 237250 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 237252 B-box zinc finger 237275 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 237275 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 237284 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 237285 Ribosomal protein L23 237285 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 237295 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 237296 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 237300 Protein kinase domain 237303 Triosephosphate isomerase 237305 Enolase, N-terminal domain 237305 Enolase, C-terminal TIM barrel domain 237310 Tissue factor 237310 Tissue factor 237313 Tissue factor 237319 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 237319 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 237320 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 237320 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 237321 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 237323 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237331 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 237331 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 237335 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 237335 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 237335 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 237342 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 237343 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 237343 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 237360 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 237360 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 237361 Ribosomal protein L15 237362 7 transmembrane receptor (rhodopsin family) 237369 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 237369 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 237369 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 237374 Sodium:solute symporter family 237374 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 237391 Protein-tyrosine phosphatase 237391 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 237396 ENV polyprotein (coat polyprotein) 237396 7 transmembrane receptor (metabotropic glutamate family) 237396 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 237396 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 237396 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 237396 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 237396 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 237397 C2 domain 237403 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 237408 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237409 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237410 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 237410 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237412 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237415 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 237415 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 237415 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 237415 IMP dehydrogenase / GMP reductase C terminus. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is 237427 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 237432 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 237433 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 237436 Growth-Arrest-Specific Protein 2 Domain 237436 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 237438 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 237438 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 237448 Translation initiation factor SUI1 237459 Protein kinase domain 237482 Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family 237482 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem 237483 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 237483 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 237499 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 237500 TPR Domain 237501 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 237512 Ribosomal protein S8 237518 AMP-binding enzyme 237523 L1 transposable element 237523 Fibronectin type III domain 237523 Protein-tyrosine phosphatase 237523 von Willebrand factor type D domain 237523 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 237536 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 237542 Oxysterol-binding protein 237542 PH domain. PH stands for pleckstrin homology 237546 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 237547 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 237549 SCP-like extracellular protein 237549 SCP-like extracellular protein. This domain is also found in prokaryotes 237553 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 237564 Kinesin motor domain 237564 Intermediate filament protein 237564 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 237577 Ribosomal protein S26e 237581 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 237604 Ribosomal protein S19e 237611 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 237611 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 237611 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 237612 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 237626 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 237630 PH domain. PH stands for pleckstrin homology 237636 AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance 237636 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 237666 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 237666 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 237668 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 237696 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 237699 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 237703 Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phosphat 237704 ENV polyprotein (coat polyprotein) 237704 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 237704 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 237704 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 237711 WD domain, G-beta repeat 237720 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 237730 Fibrillarin 237751 GTPase of unknown function 237751 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 237755 7 transmembrane receptor (rhodopsin family) 237757 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 237761 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 237761 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 237765 Ribosomal protein S6e 237767 7 transmembrane receptor (rhodopsin family) 237769 7 transmembrane receptor (rhodopsin family) 237769 C. elegans Sre G protein-coupled chemoreceptor. Caenorhabditis elegans Sre proteins are candidate chemosensory receptors. There are four main recognized groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre (this family), Sra pfam02117 and Srb pfam0217 237769 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237769 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 237771 B-box zinc finger 237771 Zinc finger, C3HC4 type (RING finger) 237771 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 237775 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237780 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 237780 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 237806 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 237818 PMP-22/EMP/MP20/Claudin family 237823 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 237828 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 237831 Sodium:sulfate symporter transmembrane region. Some members in this family belong to the subfamily SODIT1 237831 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 237837 7 transmembrane receptor (rhodopsin family) 237837 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 237837 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 237880 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 237889 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 237891 Growth-Arrest-Specific Protein 2 Domain 237891 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 237898 Ubiquitin carboxyl-terminal hydrolase family 2 237909 Ribosomal protein L21e 237926 Oxygen-independent Coproporphyrinogen III oxidase. This family of bacterial enzymes catalyses the anaerobic transformation of coproporphyrinogen III to protoporphyrinogen IX required for porphyrin biosynthesis 237930 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 237934 Intermediate filament protein 237934 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 237940 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 237940 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 237940 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 237958 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 237958 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 237960 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 237987 Protein of unknown function, DUF270 237989 B-box zinc finger 237989 Zinc finger, C3HC4 type (RING finger) 237989 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 238011 Sulfatase 238011 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 238023 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 238024 Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterised but probably have a similar or identical function 238024 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 238029 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 238057 Transforming growth factor beta like domain 238057 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 238074 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 238076 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 238076 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 238091 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 238091 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 238094 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 238096 Ribosomal protein L13e 238096 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 238096 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 238099 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 238119 Pyruvate kinase, alpha/beta domain 238119 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 238119 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 238143 HMG (high mobility group) box 238153 Ribosomal protein S17 238162 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 238174 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 238176 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe 238198 Ribosomal protein L21e 238199 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 238205 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 238217 Ribosomal L10 238217 Ribosomal protein L16 238220 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238251 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 238252 7 transmembrane receptor (rhodopsin family) 238252 7 transmembrane receptor (rhodopsin family) 238257 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in resp 238257 Eukaryotic protein of unknown function, DUF284. Members of this family have no known function. They have been predicted to contain transmembrane helices 238257 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 238266 C2 domain 238271 Cyclic nucleotide-binding domain 238271 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 238271 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 238276 Protein kinase A anchor 238294 Galactoside-binding lectin 238294 Ribosomal L18ae protein family 238320 Transposase. Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family 238323 MIT domain 238324 Ribosomal protein S12 238329 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 238331 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 238333 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 238335 Eukaryotic initiation factor 1A 238338 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 238350 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 238351 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 238351 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 238351 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 238353 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 238354 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 238377 7 transmembrane receptor (rhodopsin family) 238379 Kinesin motor domain 238379 Uncharacterized ACR, COG1579 238379 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 238379 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 238384 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 238386 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 238393 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 238394 GTP1/OBG family 238394 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 238394 TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae 238395 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 238400 S25 ribosomal protein 238406 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 238406 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 238409 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238410 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238412 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238413 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238414 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238418 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238420 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238424 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238425 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238427 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238433 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238437 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238440 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238443 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238444 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238447 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238448 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 238462 ENV polyprotein (coat polyprotein) 238463 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 238463 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 238465 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 238467 Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses 238489 AMP-binding enzyme 238505 Vitamin B12 dependent methionine synthase, activation domain 238505 B12 binding domain. This domain binds to B12 (adenosylcobamide), it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase 238505 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 238505 B12 binding domain. This B12 binding domain is found in methionine synthase EC:2.1.1.13, and other shorter proteins that bind to B12. This domain is always found to the N-terminus of pfam02310. The structure of this domain is known, it is a 4 helix bundle 238505 Pterin binding enzyme. This family includes a variety of pterin binding enzymes that all adopt a TIM barrel fold. The family includes dihydropteroate synthase EC:2.5.1.15 as well as a group methyltransferase enzymes including methyltetrahydrofolate, corri 238507 Ribonucleotide reductase, small chain 238533 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 238533 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 238547 Core histone H2A/H2B/H3/H4 238550 Core histone H2A/H2B/H3/H4 238550 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 238555 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 238564 Protein kinase domain 238568 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 238569 7 transmembrane receptor (metabotropic glutamate family) 238569 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 238569 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 238569 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 238569 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 238569 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 238572 Sugar (and other) transporter 238586 Chaperonin 10 Kd subunit 238599 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 238619 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 238619 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 238623 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 238637 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 238637 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 238639 Cyclophilin type peptidyl-prolyl cis-trans isomerase 238643 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 238656 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 238657 Actin 238661 Ribosomal protein L13e 238672 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 238678 ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems 238679 Ribosomal protein L23 238680 Fibrinogen beta and gamma chains, C-terminal globular domain 238680 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 238681 Translation initiation factor SUI1 238688 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 238688 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 238690 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 238692 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 238693 tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes 238703 NADH dehydrogenase 238710 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 238722 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 238724 ENV polyprotein (coat polyprotein) 238725 7 transmembrane receptor (rhodopsin family) 238725 7 transmembrane receptor (rhodopsin family) 238756 Ribosomal protein S5, C-terminal domain 238756 Ribosomal protein S5, N-terminal domain 238761 HMG (high mobility group) box 238765 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 238766 Ribosomal protein L21e 238786 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 238787 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 238787 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 238787 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 238790 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 238790 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 238804 Ribosomal L15 238836 Core histone H2A/H2B/H3/H4 238871 3'5'-cyclic nucleotide phosphodiesterase 238871 3'5'-cyclic nucleotide phosphodiesterase 238880 Actin 238880 Actin 238881 Electron transfer flavoprotein alpha subunit. This protein is distantly related to and forms a heterodimer with pfam01012 238896 WD domain, G-beta repeat 238927 Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair 238939 Cadherin domain 238943 Enolase, N-terminal domain 238943 Enolase, C-terminal TIM barrel domain 238952 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 238955 Homeobox domain 238955 Pou domain - N-terminal to homeobox domain 238963 MIT domain 238963 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 238974 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 239006 Ribosomal protein S7e 239006 Orbivirus outer capsid protein VP2. VP2 acts as an anchor for VP1 and VP3. VP2 contains a non-specific DNA and RNA binding domain in the N-terminus 239013 Core histone H2A/H2B/H3/H4 239017 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 239017 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 239037 Fibronectin type III domain 239038 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 239052 ENV polyprotein (coat polyprotein) 239053 ENV polyprotein (coat polyprotein) 239054 Protein-tyrosine phosphatase 239054 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 239055 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 239055 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 239076 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 239080 7 transmembrane receptor (rhodopsin family) 239081 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 239085 Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re 239087 Transcription initiation factor TFIID 23-30kDa subunit 239089 7 transmembrane receptor (metabotropic glutamate family) 239090 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 239091 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 239097 Leishmanolysin 239099 Homeobox domain 239106 Trypsin 239107 Fructose-bisphosphate aldolase class-I 239118 Ribosomal protein L13e 239121 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 239121 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 239122 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 239125 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 239126 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 239126 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 239138 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 239151 GTPase of unknown function 239157 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 239157 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 239158 'Cold-shock' DNA-binding domain 239167 ENV polyprotein (coat polyprotein) 239177 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 239177 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 239182 Ribosomal L29e protein family 239186 FKBP-type peptidyl-prolyl cis-trans isomerase 239189 PH domain. PH stands for pleckstrin homology 239191 C-5 cytosine-specific DNA methylase 239192 Olfactomedin-like domain 239193 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 239199 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 239217 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 239217 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 239245 Ribosomal protein L15 239250 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 239250 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 239273 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 239273 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 239283 7 transmembrane receptor (rhodopsin family) 239294 Ribosomal protein L19e 239314 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 239318 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 239318 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 239318 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 239319 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 239324 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 239324 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 239336 7 transmembrane receptor (rhodopsin family) 239336 7 transmembrane receptor (rhodopsin family) 239337 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 239337 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 239337 Ribosomal protein L23 239337 Thrombospondin type 1 domain 239337 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 239337 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 239341 Elongation factor 1 gamma, conserved domain 239343 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 239343 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 239343 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 239351 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 239364 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 239365 ENV polyprotein (coat polyprotein) 239366 ENV polyprotein (coat polyprotein) 239369 ENV polyprotein (coat polyprotein) 239378 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 239383 Ribonucleotide reductase, small chain 239390 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 239390 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 239392 Core histone H2A/H2B/H3/H4 239393 CUB domain 239406 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 239406 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 239411 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 239418 Ribosomal L18ae protein family 239420 CUB domain 239420 Sushi domain (SCR repeat) 239422 CUB domain 239422 Sushi domain (SCR repeat) 239424 CUB domain 239428 ENV polyprotein (coat polyprotein) 239428 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 239428 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 239428 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 239428 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 239434 Ribosomal L15 239454 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 239472 Ribosomal protein L24e 239488 ENV polyprotein (coat polyprotein) 239499 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 239502 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 239502 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 239510 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 239515 Glycosyl hydrolase family 14. This family are beta amylases 239525 DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at 239526 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 239528 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 239530 7 transmembrane receptor (rhodopsin family) 239536 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 239537 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 239540 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 239554 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 239554 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 239555 Mab-21 protein 239555 Mab-21 protein 239556 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 239564 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 239564 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 239591 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 239593 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 239598 S25 ribosomal protein 239598 Poly-adenylate binding protein, unique domain 239603 Sugar (and other) transporter 239606 Sugar (and other) transporter 239609 von Willebrand factor type D domain 239609 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 239611 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 239615 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 239618 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 239634 ENV polyprotein (coat polyprotein) 239651 7 transmembrane receptor (rhodopsin family) 239652 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 239652 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 239658 Ribosomal protein L36e 239659 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 239667 AMP-binding enzyme 239673 Intermediate filament protein 239673 Intermediate filament protein 239673 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 239673 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 239674 Intermediate filament protein 239679 Homeobox domain 239685 ENV polyprotein (coat polyprotein) 239685 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 239685 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 239685 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 239690 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 239707 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 239708 PWI domain 239710 ENV polyprotein (coat polyprotein) 239714 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 239719 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 239724 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 239724 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 239728 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 239731 Fibronectin type III domain 239731 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 239739 Lysosome-associated membrane glycoprotein (Lamp) 239740 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 239740 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 239743 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 239743 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 239748 Elongation factor 1 gamma, conserved domain 239748 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 239749 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 239755 Ribosomal protein L11, RNA binding domain 239755 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 239759 Lipase 239759 Lipase 239761 Ribosomal protein L35Ae 239761 Core histone H2A/H2B/H3/H4 239791 Eukaryotic porin 239796 Mab-21 protein 239796 Mab-21 protein 239814 S25 ribosomal protein 239826 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 239827 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 239828 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 239833 Leishmanolysin 239835 Protein kinase domain 239835 Fibronectin type III domain 239845 7 transmembrane receptor (metabotropic glutamate family) 239845 7 transmembrane receptor (metabotropic glutamate family) 239846 Cyclophilin type peptidyl-prolyl cis-trans isomerase 239852 Zona pellucida-like domain 239853 7 transmembrane receptor (Secretin family) 239853 7 transmembrane receptor (Secretin family) 239853 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 239855 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 239857 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 239857 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 239863 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 239878 Glycyl-tRNA synthetase beta subunit 239878 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 239893 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 239916 Phosphoglycerate kinase 239931 PMP-22/EMP/MP20/Claudin family 239931 PMP-22/EMP/MP20/Claudin family 239933 Ribosomal protein L13e 239940 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int 239950 Galactose binding lectin domain 239955 Ribosomal protein S6e 239995 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 239998 Clathrin adaptor complex small chain 240003 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 240003 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 240007 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 240009 Ribosomal protein L19e 240027 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240028 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act 240029 7 transmembrane receptor (metabotropic glutamate family) 240029 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 240030 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 240031 Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4 240034 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240038 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240039 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240040 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240041 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240042 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 240045 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 240047 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 240047 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 240047 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 240053 7 transmembrane receptor (rhodopsin family) 240054 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid 240055 Ribosomal protein S26e 240057 C2 domain 240057 Tropomyosin 240057 PH domain. PH stands for pleckstrin homology 240057 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 240057 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a 240058 C2 domain 240061 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 240063 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240064 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240064 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240067 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240068 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240069 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 240072 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 240073 Class II histocompatibility antigen, alpha domain 240074 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 240074 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 240075 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 240075 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 240087 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 240089 Core histone H2A/H2B/H3/H4 240091 Protein kinase domain 240091 Class I Histocompatibility antigen, domains alpha 1 and 2 240093 Zinc finger, C3HC4 type (RING finger) 240094 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 240095 Class I Histocompatibility antigen, domains alpha 1 and 2 240095 Class I Histocompatibility antigen, domains alpha 1 and 2 240097 7 transmembrane receptor (rhodopsin family) 240100 7 transmembrane receptor (rhodopsin family) 240110 Ribosomal L29e protein family 240117 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play 240119 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 240119 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo 240120 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240121 Fibronectin type III domain 240121 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 240123 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 240124 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 240125 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 240155 Ribosomal protein L11, RNA binding domain 240156 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 240160 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero 240168 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 240168 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 240171 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 240172 Ribosomal L29e protein family 240182 Cullin family 240182 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 240183 Zinc finger, C3HC4 type (RING finger) 240188 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 240188 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 240195 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 240198 Proteasome A-type and B-type 240204 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 240204 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 240205 Triosephosphate isomerase 240215 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 240216 Kinesin motor domain 240219 Glutamine synthetase, beta-Grasp domain 240219 Glutamine synthetase, catalytic domain 240222 Ribosomal protein S5, C-terminal domain 240222 Ribosomal protein S5, N-terminal domain 240232 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 240236 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 240236 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family 240239 7 transmembrane receptor (rhodopsin family) 240255 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 240255 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 240261 Glycosyl hydrolase family 26 240261 Uncharacterized ACR, COG1579 240269 Ribosomal protein L6e 240277 Enolase, C-terminal TIM barrel domain 240284 WD domain, G-beta repeat 240289 Proteasome A-type and B-type 240303 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 240303 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 240322 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 240322 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 240322 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 240332 Sodium:neurotransmitter symporter family 240343 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 240350 Ribosomal protein S7e 240350 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 240360 Galactose-1-phosphate uridyl transferase, C-terminal domain. SCOP reports fold duplication with N-terminal domain. Both involved in Zn and Fe binding 240367 Ribosomal L29e protein family 240411 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 240444 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 240444 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 240456 Ribosomal protein L36e 240456 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 240478 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 240479 Protein kinase domain 240482 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 240490 HMG (high mobility group) box 240493 PTB domain (IRS-1 type) 240493 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 240495 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 240496 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 240496 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 240498 ENV polyprotein (coat polyprotein) 240498 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 240498 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 240516 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 240519 Microtubule associated protein (MAP65/ASE1 family) 240525 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 240527 Mitochondrial carrier protein 240531 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 240537 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 240539 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 240541 Hsp90 protein 240543 7 transmembrane receptor (rhodopsin family) 240544 7 transmembrane receptor (rhodopsin family) 240545 7 transmembrane receptor (rhodopsin family) 240551 ENV polyprotein (coat polyprotein) 240551 7 transmembrane receptor (rhodopsin family) 240551 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 240551 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 240551 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 240551 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 240569 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HCC 240569 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways. This family also contains members with functions other than HCCA 240582 Hyaluronidase 240590 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 240590 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 240590 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 240595 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 240595 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 240632 ab-hydrolase associated lipase region 240632 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 240633 ab-hydrolase associated lipase region 240641 Kinesin motor domain 240656 HMG (high mobility group) box 240657 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 240660 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 240660 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 240665 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi 240667 WD domain, G-beta repeat 240669 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 240670 Ribosomal protein L31e 240672 Protein-tyrosine phosphatase 240672 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 240672 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 240675 von Willebrand factor type A domain 240677 ENV polyprotein (coat polyprotein) 240677 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 240677 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 240677 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 240677 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 240679 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 240679 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 240689 Proteasome A-type and B-type 240690 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 240693 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 240693 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 240693 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 240710 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 240722 Mitochondrial carrier protein 240725 Sulfatase 240726 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 240731 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 240731 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 240752 C2 domain 240752 Phosphatidylinositol 3- and 4-kinase 240752 PX domain. PX domains bind to phosphoinositides 240752 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 240752 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 240752 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 240752 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains (unpublished observa 240753 PH domain. PH stands for pleckstrin homology 240753 PH domain. PH stands for pleckstrin homology 240756 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 240756 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 240756 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 240756 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 240761 GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 240761 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 240762 7 transmembrane receptor (rhodopsin family) 240763 Macrophage migration inhibitory factor (MIF) 240773 Sushi domain (SCR repeat) 240776 Uncharacterized protein family, UPF0065 240776 Calcium-activated BK potassium channel alpha subunit 240776 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 240779 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 240779 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 240779 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 240783 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 240798 ENV polyprotein (coat polyprotein) 240798 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 240799 Ribosomal protein S6e 240811 Ribosomal protein S5, C-terminal domain 240811 Ribosomal protein S5, N-terminal domain 240841 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 240843 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 240867 7 transmembrane receptor (rhodopsin family) 240869 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 240869 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 240871 Cyclic nucleotide-binding domain 240871 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 240880 Protein kinase domain 240888 7 transmembrane receptor (rhodopsin family) 240891 Protein of unknown function DUF47. This family includes prokaryotic proteins of unknown function, as well as a protein annotated as the pit accessory protein from Sinorhizobium meliloti. However, the function of this protein is also unknown (Pit stands fo 240892 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 240894 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 240895 SKIP/SNW domain. This domain is found in chromatin proteins 240898 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 240903 Translation initiation factor SUI1 240913 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 240921 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 240921 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 240923 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 240957 Fatty acid desaturase 240958 Fatty acid desaturase 240958 Protein kinase domain 240961 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 240970 Ribosomal protein L21e 241003 Fibronectin type III domain 241004 Fibronectin type III domain 241005 Fibronectin type III domain 241020 Deuterolysin metalloprotease (M35) family 241025 Ribosomal protein S17 241035 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 241041 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 241042 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241042 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241042 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 241042 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 241051 Protein kinase domain 241053 Ribosomal protein L11, RNA binding domain 241053 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 241054 Nucleoside diphosphate kinase 241056 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 241059 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 241060 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 241060 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 241063 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 241063 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 241063 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 241070 7 transmembrane receptor (rhodopsin family) 241073 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 241074 Cyclophilin type peptidyl-prolyl cis-trans isomerase 241075 PH domain. PH stands for pleckstrin homology 241077 Ribosomal protein L31e 241079 Protein kinase domain 241080 Protein kinase domain 241090 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 241091 Mitochondrial carrier protein 241097 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 241111 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 241118 Amiloride-sensitive sodium channel 241119 SKIP/SNW domain. This domain is found in chromatin proteins 241119 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 241140 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 241151 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 241152 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 241153 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 241154 ADP-ribosylation factor family 241154 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 241157 7 transmembrane receptor (rhodopsin family) 241158 MYND finger 241158 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 241176 ENV polyprotein (coat polyprotein) 241177 Proteasome A-type and B-type 241193 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 241197 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 241201 Cadherin domain 241201 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili 241213 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 241215 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 241215 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 241217 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 241218 Microtubule associated protein (MAP65/ASE1 family) 241219 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 241219 Up-frameshift suppressor 2. Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay. In Saccharomyces cerevisiae, three trans-actin 241226 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 241228 CUB domain 241228 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 241230 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 241242 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241242 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241242 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 241242 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 241242 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 241263 7 transmembrane receptor (metabotropic glutamate family) 241263 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 241270 Zinc finger, C3HC4 type (RING finger) 241272 ENV polyprotein (coat polyprotein) 241294 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 241301 Cyclophilin type peptidyl-prolyl cis-trans isomerase 241308 PH domain. PH stands for pleckstrin homology 241311 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 241311 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 241320 Ribosomal protein L36e 241322 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 241324 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 241325 Thymosin beta-4 family 241327 Olfactomedin-like domain 241329 Uncharacterised protein family (UPF0108) 241329 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 241366 ENV polyprotein (coat polyprotein) 241366 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241366 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241366 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 241366 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 241385 Papain family cysteine protease 241390 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 241391 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 241422 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 241424 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 241432 LIM domain. This family represents two copies of the LIM structural domain 241440 ENV polyprotein (coat polyprotein) 241447 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 241452 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 241452 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 241462 Translation initiation factor SUI1 241487 Ribosomal protein S6e 241489 3'5'-cyclic nucleotide phosphodiesterase 241489 GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases 241490 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati 241504 Ribosomal L10 241506 Ribosomal protein L6 241512 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 241512 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 241515 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 241516 High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin 241521 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 241521 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 241524 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 241525 Yippee putative zinc-binding protein 241528 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 241528 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 241530 7 transmembrane receptor (rhodopsin family) 241531 7 transmembrane receptor (rhodopsin family) 241532 7 transmembrane receptor (rhodopsin family) 241536 7 transmembrane receptor (rhodopsin family) 241537 7 transmembrane receptor (rhodopsin family) 241539 7 transmembrane receptor (rhodopsin family) 241540 Homeobox domain 241552 Cyclophilin type peptidyl-prolyl cis-trans isomerase 241556 Tetraspanin family 241568 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 241568 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 241572 Pyruvate kinase, alpha/beta domain 241572 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 241572 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 241587 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 241593 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 241593 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 241612 Spore germination protein 241612 Sodium:solute symporter family 241621 Ribosomal protein L15 241625 Core histone H2A/H2B/H3/H4 241626 Transglutaminase family 241626 Transglutaminase family, C-terminal ig like domain 241626 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 241633 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 241634 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 241635 TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins t 241636 Transglutaminase family 241636 Transglutaminase family, C-terminal ig like domain 241636 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 241636 Transglutaminase family 241636 Transglutaminase family, C-terminal ig like domain 241636 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 241639 PH domain. PH stands for pleckstrin homology 241658 Ribosomal L10 241672 Calreticulin family 241681 Kinesin motor domain 241690 Ribosomal L10 241694 Rap/ran-GAP 241697 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 241697 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 241709 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 241714 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 241716 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 241723 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 241725 DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to PF01119 241725 DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to pfam01119 241725 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 241732 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 241735 MYND finger 241737 Ribosomal protein L6 241764 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 241764 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 241764 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 241765 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 241765 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 241769 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 241770 C2 domain 241786 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 241794 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 241794 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 241810 Ribosomal protein S5, N-terminal domain 241830 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 241850 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 241853 ENV polyprotein (coat polyprotein) 241853 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 241853 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 241854 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241854 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241854 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 241857 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 241857 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 241860 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241860 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241860 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 241863 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 241864 Protein kinase domain 241865 Zinc finger, C3HC4 type (RING finger) 241866 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 241866 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 241874 Cyclophilin type peptidyl-prolyl cis-trans isomerase 241877 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 241878 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 241879 ENV polyprotein (coat polyprotein) 241882 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 241883 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241883 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241883 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 241890 Amino acid permease 241890 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 241890 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 241890 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 241897 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 241898 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 241899 ENV polyprotein (coat polyprotein) 241901 ENV polyprotein (coat polyprotein) 241901 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 241901 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 241915 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 241923 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 241942 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 241944 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 241950 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 241950 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 241957 Ribosomal protein L21e 241965 Cadherin domain 241965 Cadherin domain 241965 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 241967 Cyclophilin type peptidyl-prolyl cis-trans isomerase 241980 Cyclophilin type peptidyl-prolyl cis-trans isomerase 241983 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 241983 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 241983 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 241985 Pyruvate kinase, alpha/beta domain 241985 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 241989 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 241992 14-3-3 protein 241994 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 241994 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 241996 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 241996 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 242025 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 242036 Fructose-bisphosphate aldolase class-I 242037 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 242039 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 242039 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 242050 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 242050 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 242051 Poly-adenylate binding protein, unique domain 242051 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 242052 Carboxylesterase 242054 Enolase, C-terminal TIM barrel domain 242060 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 242061 ENV polyprotein (coat polyprotein) 242065 PH domain. PH stands for pleckstrin homology 242083 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 242084 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 242088 Cadherin domain 242093 7 transmembrane receptor (rhodopsin family) 242093 7 transmembrane receptor (rhodopsin family) 242096 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 242097 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 242100 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 242101 Ribosomal protein L21e 242107 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 242107 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 242111 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 242112 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 242112 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 242112 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 242122 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 242125 Mab-21 protein 242126 Sugar (and other) transporter 242126 Sugar (and other) transporter 242151 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 242151 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 242153 14-3-3 protein 242175 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 242175 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 242175 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 242185 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 242195 Cyclophilin type peptidyl-prolyl cis-trans isomerase 242199 Zinc-binding dehydrogenase 242200 Cyclophilin type peptidyl-prolyl cis-trans isomerase 242202 3'5'-cyclic nucleotide phosphodiesterase 242205 14-3-3 protein 242206 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 242206 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 242206 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 242220 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 242221 Ribosomal protein L19e 242244 Cyclophilin type peptidyl-prolyl cis-trans isomerase 242257 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 242259 Domain of unknown function DUF20. This transmembrane region is found in putative permeases and predicted transmembrane proteins it has no known function. It is not clear what source suggested that these proteins may be permeases and this information shoul 242280 Protein kinase domain 242285 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 242285 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 242285 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 242286 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 242291 Inositol monophosphatase family 242311 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 242316 Transforming growth factor beta like domain 242316 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 242317 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 242317 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 242318 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 242319 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 242321 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 242339 Cyclic nucleotide-binding domain 242340 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 242341 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 242341 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 242363 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 242363 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 242363 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 242373 Ribosomal protein L23 242377 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 242384 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 242384 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 242384 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 242386 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 242388 Ribosomal protein S6e 242410 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 242410 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 242412 HMG (high mobility group) box 242425 7 transmembrane receptor (metabotropic glutamate family) 242425 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 242443 Bacterial extracellular solute-binding proteins, family 3 242443 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 242447 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 242451 NAC domain 242459 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 242474 Domain of unknown function DUF20. This transmembrane region is found in putative permeases and predicted transmembrane proteins it has no known function. It is not clear what source suggested that these proteins may be permeases and this information shoul 242481 Paralemmin 242496 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 242497 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 242499 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 242499 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 242502 Ribonucleotide reductase, small chain 242505 Uncharacterized ACR, COG1579 242505 Intermediate filament protein 242505 ADP-ribosylation factor family 242505 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 242505 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 242505 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 242505 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 242505 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 242506 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 242517 Interferon alpha/beta domain 242519 Interferon alpha/beta domain 242519 Interferon alpha/beta domain 242521 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 242523 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 242523 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 242523 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 242525 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 242536 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 242540 Ribosomal Proteins L2, C-terminal domain 242540 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 242540 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 242546 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 242555 Ribosomal family S4e 242557 Peptidase family C54 242570 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 242597 Ribosomal L29e protein family 242599 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 242603 CUB domain 242607 Sodium:dicarboxylate symporter family 242620 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 242643 Ribosomal protein L36e 242646 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 242646 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 242646 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 242651 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 242651 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 242653 PMP-22/EMP/MP20/Claudin family 242662 C2 domain 242663 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 242663 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 242665 Palmitoyl protein thioesterase 242667 Guanylate-kinase-associated protein (GKAP) protein 242669 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 242681 ADP-ribosylation factor family 242681 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 242702 Fibronectin type III domain 242702 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 242703 Core histone H2A/H2B/H3/H4 242705 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ 242705 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 242711 Trypsin 242711 Protein of unknown function DUF100. This prokaryotic family has no known function 242712 CUB domain 242721 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 242726 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L- 242733 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 242735 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 242737 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 242740 Carboxylesterase 242741 Carboxylesterase 242742 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 242744 HMG (high mobility group) box 242744 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 242745 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 242747 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 242748 AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance 242748 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 242767 Sugar (and other) transporter 242785 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 242785 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 242789 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 242789 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int 242805 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 242805 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 242809 Ribosomal protein L3 242819 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 242823 ENV polyprotein (coat polyprotein) 242823 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 242823 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 242823 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 242825 Protein kinase domain 242825 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 242827 Ribosomal protein L21e 242842 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 242842 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 242842 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 242848 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 242851 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 242852 ADP-ribosylation factor family 242852 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 242891 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 242894 Ribosomal protein L31e 242904 Cyclophilin type peptidyl-prolyl cis-trans isomerase 242905 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 242914 Kinesin motor domain 242939 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 242940 Protein kinase domain 242940 POLO box duplicated region 242960 F-box domain 242960 F-box domain 242965 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 242987 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 242987 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 242987 Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found 242987 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 242997 Uncharacterized ACR, COG1579 242997 Intermediate filament protein 242997 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 242997 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 243011 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 243014 ADP-ribosylation factor family 243014 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 243016 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 243040 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 243043 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 243043 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 243044 Adenylate kinase 243044 Thymidylate kinase 243060 ADP-ribosylation factor family 243060 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 243062 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 243065 Protein kinase domain 243068 Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barre 243077 Ribosomal protein L31e 243078 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster 243083 Trypsin 243083 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 243085 UDP-glucoronosyl and UDP-glucosyl transferase 243099 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 243100 Serum albumin family 243119 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 243126 HMG (high mobility group) box 243150 Ribosomal L22e protein family 243162 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 243163 Ribosomal protein L31e 243176 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 243176 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 243176 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243183 Ribosomal L15 243187 Homeobox domain 243200 7 transmembrane receptor (metabotropic glutamate family) 243205 7 transmembrane receptor (metabotropic glutamate family) 243206 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243208 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243209 Ribosomal protein L19e 243209 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243210 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243220 Sulfotransferase protein 243220 WSC domain. This domain may be involved in carbohydrate binding 243241 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 243241 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 243245 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243246 Protein kinase domain 243261 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 243270 7 transmembrane receptor (rhodopsin family) 243277 7 transmembrane receptor (Secretin family) 243277 7 transmembrane receptor (Secretin family) 243277 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 243302 S25 ribosomal protein 243305 Glypican 243306 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 243308 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243311 HMG (high mobility group) box 243312 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 243326 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 243328 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 243332 Core histone H2A/H2B/H3/H4 243339 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 243341 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 243349 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 243352 Nucleoside diphosphate kinase 243368 7 transmembrane receptor (metabotropic glutamate family) 243368 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 243369 Thrombospondin type 1 domain 243369 von Willebrand factor type D domain 243369 Low-density lipoprotein receptor domain class A 243369 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 243369 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 243369 Thrombospondin type 1 domain 243369 Low-density lipoprotein receptor domain class A 243369 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 243369 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 243373 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 243373 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 243374 GTPase of unknown function 243375 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 243375 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 243375 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 243376 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 243376 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 243376 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 243377 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 243377 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou 243377 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog 243378 Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the seque 243382 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 243388 ENV polyprotein (coat polyprotein) 243407 7 transmembrane receptor (rhodopsin family) 243419 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243420 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243422 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243423 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243425 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243427 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 243428 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243430 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243431 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243433 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243434 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243436 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243437 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243438 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243439 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243440 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243442 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243444 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243445 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243446 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243448 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243449 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243450 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243455 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243457 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243458 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243459 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243460 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243461 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243462 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243463 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243464 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 243465 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243467 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243468 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243469 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243479 7 transmembrane receptor (rhodopsin family) 243484 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 243484 Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI) 243484 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 243497 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 243514 Eukaryotic ribosomal protein L18 243523 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 243529 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 243536 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 243543 Ribosomal protein L23 243547 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 243548 LIM domain. This family represents two copies of the LIM structural domain 243574 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 243574 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 243575 Elongation factor 1 gamma, conserved domain 243606 NAC domain 243612 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 243616 Sodium:neurotransmitter symporter family 243617 Ribosomal protein L15 243621 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 243628 7 transmembrane receptor (metabotropic glutamate family) 243628 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 243628 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 243628 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 243628 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 243630 Amiloride-sensitive sodium channel 243635 Cyclophilin type peptidyl-prolyl cis-trans isomerase 243642 Ribosomal protein S5, C-terminal domain 243642 Ribosomal protein S5, N-terminal domain 243645 ADP-ribosylation factor family 243645 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 243645 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 243645 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 243645 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 243645 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 243658 Lectin C-type domain. This family includes both long and short form C-type 243665 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 243665 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 243665 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 243676 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 243676 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 243677 ENV polyprotein (coat polyprotein) 243689 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 243715 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 243716 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 243728 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 243737 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 243743 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans 243743 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T 243743 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 243748 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243753 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict 243755 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 243764 7 transmembrane receptor (rhodopsin family) 243766 Transcription factor TFIIB repeat 243781 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243784 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243785 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243786 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243790 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 243790 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 243790 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 243791 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 243794 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243796 VHS domain. Domain present in VPS-27, Hrs and STAM 243801 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 243811 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 243813 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 243815 ENV polyprotein (coat polyprotein) 243815 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 243815 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 243815 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 243815 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 243821 Protein kinase domain 243823 Isochorismatase family. This family are hydrolase enzymes 243824 Protein kinase domain 243827 ENV polyprotein (coat polyprotein) 243827 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 243827 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 243827 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 243831 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243831 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 243833 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243834 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243834 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 243835 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 243836 Ribosomal protein S8e 243841 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 243841 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 243841 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 243849 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 243851 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 243858 Ribosomal L15 243863 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 243864 Class I Histocompatibility antigen, domains alpha 1 and 2 243864 Class I Histocompatibility antigen, domains alpha 1 and 2 243865 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 243865 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 243867 F-box domain 243869 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 243869 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 243869 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 243873 Ribosomal protein S19e 243874 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 243874 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 243875 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 243878 Ribosomal L15 243880 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 243881 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 243884 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243884 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 243889 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 243902 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243902 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 243905 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243906 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243906 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243908 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 243912 Hsp20/alpha crystallin family 243914 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 243914 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 243917 Ribosomal protein L23 243917 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 243918 Glycosyltransferase family 6 243918 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 243918 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 243937 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 243944 ENV polyprotein (coat polyprotein) 243948 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243951 7 transmembrane receptor (metabotropic glutamate family) 243951 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 243955 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243956 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243959 Trypsin 243961 Sterile alpha motif (SAM)/Pointed domain 243961 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 243961 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 243961 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 243961 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 243963 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243963 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 243964 Sodium:neurotransmitter symporter family 243965 Sodium:neurotransmitter symporter family 243967 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 243967 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 243968 Protein kinase domain 243973 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 243976 Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate 243978 7 transmembrane receptor (rhodopsin family) 243979 7 transmembrane receptor (rhodopsin family) 243979 7 transmembrane receptor (rhodopsin family) 243981 7 transmembrane receptor (rhodopsin family) 244000 Zinc finger, C3HC4 type (RING finger) 244000 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 244003 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 244023 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 244023 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 244023 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 244027 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 244031 ENV polyprotein (coat polyprotein) 244041 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244041 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244049 C2 domain 244050 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244050 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244052 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244052 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244059 'chromo' (CHRromatin Organization MOdifier) domain 244059 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 244059 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 244059 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 244061 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 244067 PH domain. PH stands for pleckstrin homology 244067 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 244089 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 244089 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 244089 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 244090 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidine 244090 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins. One member is a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane. One member is a a Na+-dependent nucleoside transporter se 244099 HMG (high mobility group) box 244111 Translation initiation factor SUI1 244112 7 transmembrane receptor (metabotropic glutamate family) 244112 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 244114 7 transmembrane receptor (metabotropic glutamate family) 244114 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 244115 7 transmembrane receptor (metabotropic glutamate family) 244115 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 244116 7 transmembrane receptor (metabotropic glutamate family) 244116 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 244121 NAC domain 244137 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 244144 Ubiquitin carboxyl-terminal hydrolase family 2 244144 Ubiquitin carboxyl-terminal hydrolases family 2 244145 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 244153 Protein kinase domain 244153 Adenylate and Guanylate cyclase catalytic domain 244153 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 244153 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 244167 Ribosomal L15 244171 HMG (high mobility group) box 244178 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 244178 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 244179 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 244183 B-box zinc finger 244183 Zinc finger, C3HC4 type (RING finger) 244187 7 transmembrane receptor (rhodopsin family) 244195 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 244197 7 transmembrane receptor (rhodopsin family) 244198 Olfactomedin-like domain 244200 7 transmembrane receptor (rhodopsin family) 244202 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 244202 L1 transposable element 244202 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 244202 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 244202 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 244209 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 244212 Sulfotransferase protein 244214 GDP dissociation inhibitor 244218 Ciliary neurotrophic factor 244220 von Willebrand factor type A domain 244226 Ribosomal protein L31e 244229 Ribosomal protein S5, C-terminal domain 244229 Ribosomal protein S5, N-terminal domain 244232 7 transmembrane receptor (rhodopsin family) 244234 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 244236 von Willebrand factor type D domain 244236 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 244236 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 244238 7 transmembrane receptor (rhodopsin family) 244238 7 transmembrane receptor (rhodopsin family) 244239 Actin 244243 Lectin C-type domain. This family includes both long and short form C-type 244250 7 transmembrane receptor (metabotropic glutamate family) 244258 Ribosomal protein S2 244258 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 244292 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 244310 Guanylate-kinase-associated protein (GKAP) protein 244321 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 244324 CUB domain 244329 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 244344 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 244345 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 244348 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 244349 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 244349 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 244361 Ribosomal L29e protein family 244371 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 244373 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 244375 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244375 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244389 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 244414 Ribosomal protein L19e 244414 Ribosomal protein L31e 244416 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, 244416 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 244417 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 244417 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 244421 Zinc finger, C3HC4 type (RING finger) 244421 ATP-dependent protease La (LON) domain 244422 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244428 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 244433 ENV polyprotein (coat polyprotein) 244433 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 244433 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 244433 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 244435 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 244442 HMG (high mobility group) box 244443 Disintegrin 244443 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 244443 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 244448 Zinc finger, C3HC4 type (RING finger) 244448 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 244486 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 244486 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 244486 Disintegrin 244486 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 244486 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 244487 Ribosomal protein S7e 244496 HMG (high mobility group) box 244509 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 244512 Ribosomal protein L3 244516 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 244532 Acyl-ACP thioesterase. This family consists of various acyl-acyl carrier protein (ACP) thioesterases (TE) these terminate fatty acyl group extension via hydrolyzing an acyl group on a fatty acid 244533 Possible metal-binding domain in RNase L inhibitor, RLI. Possible metal-binding domain in endoribonuclease RNase L inhibitor. Found at the N-terminal end of RNase L inhibitor proteins, adjacent to the 4Fe-4S binding domain, fer4, pfam00037. Also often fou 244537 Calx-beta domain 244555 ATP synthase subunit C 244556 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 244584 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 244585 C2 domain 244585 Intermediate filament protein 244585 Microtubule associated protein (MAP65/ASE1 family) 244585 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 244588 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244588 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244595 Carboxylesterase 244598 Carboxylesterase 244614 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 244616 Thymidine kinase 244625 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 244628 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 244637 ENV polyprotein (coat polyprotein) 244646 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 244646 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 244650 PH domain. PH stands for pleckstrin homology 244650 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 244651 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 244653 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 244653 Ribosomal protein L31e 244668 Rap/ran-GAP 244668 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 244671 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244671 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244677 ENV polyprotein (coat polyprotein) 244680 Fibronectin type III domain 244682 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 244683 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 244684 Ribosomal protein S5, N-terminal domain 244685 ENV polyprotein (coat polyprotein) 244685 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 244693 jmjN domain 244693 jmjC domain 244693 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 244694 jmjC domain 244698 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 244701 7 transmembrane receptor (rhodopsin family) 244710 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 244710 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 244710 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 244714 7 transmembrane receptor (rhodopsin family) 244723 Olfactomedin-like domain 244731 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 244754 HMG (high mobility group) box 244754 Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome 244757 Glycosyl hydrolases family 35 244758 ADP-ribosylation factor family 244763 Helix-turn-helix. This large family of DNA binding helix-turn helix proteins includes Cro and CI 244810 7 transmembrane receptor (rhodopsin family) 244811 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 244813 Homeobox domain 244813 Homeobox domain 244838 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244838 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244859 Protein kinase domain 244864 Lectin C-type domain. This family includes both long and short form C-type 244869 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 244869 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 244885 SH2 domain 244886 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 244893 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 244894 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 244913 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 244917 HMG (high mobility group) box 244919 Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription 244923 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 244925 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 244925 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 244954 Trypsin 244955 Homeobox domain 244962 PX domain. PX domains bind to phosphoinositides 244967 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 244967 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 245000 Phosphatidylinositol 3- and 4-kinase 245000 FAT domain. The FAT domain is named after FRAP, ATM and TRRAP 245001 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 245007 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 245007 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 245007 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 245010 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 245020 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 245022 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 245022 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 245023 Ribosomal protein S26e 245026 von Willebrand factor type A domain 245027 von Willebrand factor type A domain 245029 ENV polyprotein (coat polyprotein) 245029 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 245029 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 245029 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 245045 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 245045 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 245047 Ribosomal S17 245049 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 245050 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 245055 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245056 Core histone H2A/H2B/H3/H4 245062 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 245062 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 245066 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 245067 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 245068 Protein kinase domain 245074 Protein kinase domain 245075 Protein kinase domain 245086 Marek's disease glycoprotein A 245088 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 245091 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 245094 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245095 ENV polyprotein (coat polyprotein) 245097 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245098 7 transmembrane receptor (metabotropic glutamate family) 245109 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 245113 7 transmembrane receptor (metabotropic glutamate family) 245118 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 245128 Amino acid permease 245137 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245143 Protein kinase domain 245143 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 245148 7 transmembrane receptor (rhodopsin family) 245155 7 transmembrane receptor (rhodopsin family) 245155 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 245161 Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins 245163 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245171 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245185 Protein kinase domain 245190 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 245193 3'5'-cyclic nucleotide phosphodiesterase 245202 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 245208 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 245211 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 245217 FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold 245217 pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold 245238 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245239 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 245251 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 245251 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 245251 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 245253 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245258 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 245263 Homeobox domain 245266 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245269 Protein kinase domain 245278 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 245290 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 245290 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 245290 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 245293 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 245294 Defensin propeptide 245295 Skp1 family, tetramerisation domain 245299 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 245310 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 245317 Core histone H2A/H2B/H3/H4 245318 Core histone H2A/H2B/H3/H4 245328 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 245335 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 245335 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 245335 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 245347 Uncharacterised protein family (UPF0041) 245350 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 245353 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 245355 Protein kinase domain 245355 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 245357 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 245358 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 245358 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 245358 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 245358 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 245368 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 245372 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 245372 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 245372 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 245376 ENV polyprotein (coat polyprotein) 245376 LIM domain. This family represents two copies of the LIM structural domain 245376 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 245376 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 245377 PH domain. PH stands for pleckstrin homology 245380 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 245381 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 245381 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 245381 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 245391 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 245393 ENV polyprotein (coat polyprotein) 245403 WD domain, G-beta repeat 245404 WD domain, G-beta repeat 245405 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 245418 Homeobox domain 245418 Uncharacterized ACR, COG1579 245418 Pou domain - N-terminal to homeobox domain 245418 Archaeal ATPase. This family contain a conserved P-loop motif that is involved in binding ATP. This family is only found in archaebacteria and particularly in Methanococcus jannaschii that encodes sixteen members of this family 245418 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 245423 Endomembrane protein 70 245424 7 transmembrane receptor (rhodopsin family) 245436 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 245440 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 245442 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 245446 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 245446 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 245450 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 245453 Fork head domain 245456 S-adenosyl-L-homocysteine hydrolase 245456 Shikimate / quinate 5-dehydrogenase. This family contains both shikimate and quinate dehydrogenases. Shikimate 5-dehydrogenase catalyses the conversion of shikimate to 5-dehydroshikimate. This reaction is part of the shikimate pathway which is involved in 245470 EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the 245472 Synaptophysin / synaptoporin 245474 PUA domain. The PUA domain named after PseudoUridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 245474 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 245474 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 245480 ATP synthase 245483 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 245483 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 245484 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 245494 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 245494 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 245498 Zn-finger in Ran binding protein and others 245498 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 245501 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 245502 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 245506 ATP synthase 245508 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 245515 Peptidase family M41 245515 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 245525 HSF-type DNA-binding 245526 7 transmembrane receptor (rhodopsin family) 245528 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 245528 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 245532 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 245532 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 245533 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 245537 Carboxylesterase 245541 Ribosomal protein S2 245545 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 245548 Zinc finger, C3HC4 type (RING finger) 245564 Homeobox domain 245569 Fibronectin type III domain 245569 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 245573 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 245573 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 245574 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 245574 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 245575 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 245578 Cadherin domain 245580 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 245582 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 245586 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 245586 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 245589 ENV polyprotein (coat polyprotein) 245589 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 245589 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 245590 Ribosomal protein L15 245595 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 245596 Homeobox domain 245606 Ribosomal protein S5, C-terminal domain 245606 Ribosomal protein S5, N-terminal domain 245610 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle 245614 NAC domain 245618 Protein-tyrosine phosphatase 245619 Protein kinase domain 245623 Ribosomal protein S7e 245630 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 245638 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 245643 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 245643 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 245644 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 245644 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 245645 Ribosomal protein S7e 245650 Protein kinase domain 245650 Adenylate and Guanylate cyclase catalytic domain 245650 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 245651 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 245656 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 245660 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 245663 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 245666 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 245667 Arenavirus glycoprotein 245667 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 245668 Ubiquitin carboxyl-terminal hydrolase family 2 245668 Ubiquitin carboxyl-terminal hydrolases family 2 245668 Zn-finger in ubiquitin-hydrolases and other protein 245670 ADP-ribosylation factor family 245670 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 245676 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 245682 ENV polyprotein (coat polyprotein) 245683 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 245683 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 245684 PH domain. PH stands for pleckstrin homology 245684 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 245684 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 245685 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 245708 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 245757 Ribosomal protein S2 245802 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 245828 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 245838 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 245841 RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits 245857 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 245860 Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targetin 245867 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 245880 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 245881 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 245886 Vacuolar sorting protein 9 (VPS9) domain 245897 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 245903 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 245919 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 245920 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 245921 Protein kinase domain 245921 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with 245959 Mitochondrial carrier protein 245961 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 245964 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 245965 ATP synthase, Delta/Epsilon chain, beta-sandwich domain. Part of the ATP synthase CF(1). These subunits are part of the head unit of the ATP synthase. The subunits are called delta and epsilon in human and metazoan species but in bacterial species the de 245972 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 245973 V-ATPase subunit C 245977 7 transmembrane receptor (Secretin family) 245977 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 245979 LIM domain. This family represents two copies of the LIM structural domain 245980 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 245980 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 245981 Ribosomal L15 246048 Lectin C-type domain. This family includes both long and short form C-type 246049 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 246060 Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase 246073 Fork head domain 246075 7 transmembrane receptor (rhodopsin family) 246076 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 246086 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 246097 Death domain 246104 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f 246115 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 246133 Slow voltage-gated potassium channel 246143 Death domain 246145 Phosphatidylethanolamine-binding protein 246149 Homeobox domain 246152 WD domain, G-beta repeat 246153 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 246153 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 246172 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 246174 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 246177 Myosin head (motor domain) 246185 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou 246211 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso 246212 Rap/ran-GAP 246214 Fibronectin type III domain 246215 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 246228 Fibronectin type III domain 246228 Fibronectin type III domain 246228 von Willebrand factor type A domain 246234 Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor 246235 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 246237 PH domain. PH stands for pleckstrin homology 246239 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 246240 Protein kinase domain 246243 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 246245 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 246247 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 246248 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 246249 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 246252 Carboxylesterase 246253 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 246257 Domain of unknown function (DUF341) 246267 Triosephosphate isomerase 246271 Ets-domain 246274 Uncharacterized protein family UPF0005 246277 Pyridoxal-dependent decarboxylase conserved domain 246280 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 246281 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 246293 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 246297 RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impli 246301 Helix-loop-helix DNA-binding domain 246310 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 246313 7 transmembrane receptor (rhodopsin family) 246316 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 246316 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 246317 CUB domain 246317 Low-density lipoprotein receptor domain class A 246324 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 246325 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 246325 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 246327 DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits 246328 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 246329 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 246331 CUB domain 246331 MAM domain. An extracellular domain found in many receptors 246331 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 246332 Protein kinase domain 246334 P53 246358 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 246381 Protein kinase domain 246696 Mitochondrial carrier protein 246703 YjeF-related protein N-terminus 246710 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 246721 RNA polymerases L / 13 to 16 kDa subunit 246721 RNA polymerases L / 13 to 16 kDa subunit 246721 RNA polymerases L / 13 to 16 kDa subunit 246757 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 246757 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 246759 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 246760 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 246766 Glycosyltransferase family 6 246767 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 246770 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 246771 Mitochondrial carrier protein 246787 Sodium:solute symporter family 246788 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 246791 Homeobox domain 246792 Homeobox domain 252829 Homeobox domain 252830 Homeobox domain 252833 Transforming growth factor beta like domain 252833 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 252834 Transforming growth factor beta like domain 252834 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 252837 7 transmembrane receptor (rhodopsin family) 252837 7 transmembrane receptor (rhodopsin family) 252837 Thiamine biosynthesis protein (ThiI). ThiI is required for thiazole synthesis, required for thiamine biosynthesis 252856 Helix-loop-helix DNA-binding domain 252859 7 transmembrane receptor (rhodopsin family) 252859 7 transmembrane receptor (rhodopsin family) 252868 PMP-22/EMP/MP20/Claudin family 252870 Ubiquitin carboxyl-terminal hydrolase family 2 252870 Ubiquitin carboxyl-terminal hydrolases family 2 252870 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 252876 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic d 252880 OAR domain 252880 Homeobox domain 252882 MAM domain. An extracellular domain found in many receptors 252885 Homeobox domain 252886 Fork head domain 252888 Cadherin domain 252892 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 252892 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 252894 Cadherin domain 252895 Cadherin domain 252898 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 252904 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252905 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252906 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252907 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252908 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252909 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252910 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252912 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 252917 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 252917 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 252919 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 252922 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 252922 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 252923 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 252923 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 252923 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 252924 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 252924 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 252926 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 252926 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 252927 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 252928 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb 252931 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 252934 Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va 252939 7 transmembrane receptor (rhodopsin family) 252941 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 252942 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 252960 7 transmembrane receptor (rhodopsin family) 252972 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 252974 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, 252995 Fibronectin type III domain 253012 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 253013 Ribosomal protein L31e 253017 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero 253026 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 253141 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 253141 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 253228 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 253251 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 253272 Zinc finger, C3HC4 type (RING finger) 253289 Regulatory subunit of type II PKA R-subunit 253314 Eukaryotic initiation factor 4E 253323 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 253342 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 253387 Ribosomal protein L3 253430 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 253482 Ribosomal protein S6e 253512 Mitochondrial carrier protein 253558 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 253558 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 253559 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 253567 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 253659 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 253659 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 253665 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 253738 Fimbrial Usher protein. This protein is involved in biogenesis of gram negative bacterial pili 253738 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 253769 WD domain, G-beta repeat 253782 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 253825 Ribosomal protein L21e 253827 SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad 253839 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 253853 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 253853 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 253924 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 253933 7 transmembrane receptor (rhodopsin family) 253935 Fibrinogen beta and gamma chains, C-terminal globular domain 253936 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 253936 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 253943 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment 253947 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 253957 Ribosomal S3Ae family 253959 Rap/ran-GAP 253966 Ribosomal protein L21e 253980 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 253980 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 253986 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 254042 metallopeptidase family M24 254049 Actin 254088 Intermediate filament protein 254102 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 254109 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 254111 Intermediate filament protein 254116 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 254122 PX domain. PX domains bind to phosphoinositides 254145 Actin 254173 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 254240 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 254263 Cornichon protein 254263 Cornichon protein 254272 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 254287 Ribosomal protein L31e 254359 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 254394 MCM2/3/5 family 254395 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 254428 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha 254552 NUDIX domain 254552 NUDIX domain 254556 Zinc finger, C3HC4 type (RING finger) 254571 Kinesin motor domain 254573 Core histone H2A/H2B/H3/H4 254620 Intermediate filament protein 254665 Protein kinase domain 254700 Ubiquitin carboxyl-terminal hydrolase family 2 254705 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 254773 Transglycosylase SLT domain. This family is distantly related to pfam00062 254783 7 transmembrane receptor (rhodopsin family) 254784 7 transmembrane receptor (rhodopsin family) 254786 7 transmembrane receptor (rhodopsin family) 254787 7 transmembrane receptor (rhodopsin family) 254791 Intermediate filament protein 254863 Protein of unknown function (DUF423). Potential integral membrane protein 254879 7 transmembrane receptor (rhodopsin family) 254953 PH domain. PH stands for pleckstrin homology 254953 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 254953 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 254954 Ribosomal protein L6e 254973 7 transmembrane receptor (rhodopsin family) 254974 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 255173 Ribosomal protein S7e 255189 C2 domain 255189 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 255220 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 255239 Protein kinase domain 255239 Protein kinase domain 255239 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 255240 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 255290 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 255308 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 255308 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 255319 PH domain. PH stands for pleckstrin homology 255319 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 255319 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 255332 ADP-ribosylation factor family 255374 Metallo-beta-lactamase superfamily 255395 Ribosomal protein S21e 255447 Actin 255488 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 255519 HMG (high mobility group) box 255576 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 255576 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 255608 Ubiquitin carboxyl-terminal hydrolase family 2 255608 Ubiquitin carboxyl-terminal hydrolases family 2 255609 Ubiquitin carboxyl-terminal hydrolase family 2 255609 Ubiquitin carboxyl-terminal hydrolases family 2 255610 Ubiquitin carboxyl-terminal hydrolase family 2 255610 Ubiquitin carboxyl-terminal hydrolases family 2 255631 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 255681 Ribosomal protein L31e 255701 Ribosomal protein L44 255713 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 255723 7 transmembrane receptor (rhodopsin family) 255725 7 transmembrane receptor (rhodopsin family) 255726 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 255726 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 255731 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 255738 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con 255743 MAM domain. An extracellular domain found in many receptors 255758 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 255761 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 255777 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 255812 FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase 255814 Ribosomal S17 255870 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 255877 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 255877 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 255896 Radical SAM superfamily 255896 MoaC family. Members of this family are involved in molybdenum cofactor biosynthesis. However their molecular function is not known 255926 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 255928 C2 domain 255997 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 256000 ADP-ribosylation factor family 256000 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 256006 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 256042 7 transmembrane receptor (rhodopsin family) 256051 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 256055 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 256143 7 transmembrane receptor (rhodopsin family) 256144 7 transmembrane receptor (rhodopsin family) 256148 7 transmembrane receptor (rhodopsin family) 256158 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 256190 7 transmembrane receptor (rhodopsin family) 256233 Eukaryotic ribosomal protein L18 256248 Adenylate kinase 256281 NUDIX domain 256297 Helix-loop-helix DNA-binding domain 256329 Intermediate filament tail domain 256355 Ribosomal protein S5, N-terminal domain 256364 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes 256368 LIM domain. This family represents two copies of the LIM structural domain 256372 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 256374 Cyclophilin type peptidyl-prolyl cis-trans isomerase 256380 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 256394 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 256401 Protein-tyrosine phosphatase 256401 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 256435 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 256441 Ribosomal L22e protein family 256447 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 256457 Ribosomal protein L21e 256470 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 256471 Sugar (and other) transporter 256536 FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions 256568 XRCC1 N terminal domain 256576 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 256597 7 transmembrane receptor (rhodopsin family) 256691 MAM domain. An extracellular domain found in many receptors 256755 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 256892 7 transmembrane receptor (rhodopsin family) 256949 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 256970 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 256987 TMS membrane protein/tumour differentially expressed protein (TDE) 256987 TMS membrane protein/tumour differentially expressed protein (TDE) 257019 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 257022 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 257039 Ribosomal S17 257054 pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold 257067 14-3-3 protein 257068 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 257079 Glutaredoxin 257089 7 transmembrane receptor (rhodopsin family) 257101 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 257106 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 257137 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 257202 Glutathione peroxidase 257202 Glutathione peroxidase 257210 Cyclophilin type peptidyl-prolyl cis-trans isomerase 257218 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 257218 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 257240 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 257240 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 257340 Trehalase 257364 PX domain. PX domains bind to phosphoinositides 257397 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 257408 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 257408 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 257632 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 257632 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 257634 Homeobox domain 257634 LIM domain. This family represents two copies of the LIM structural domain 257643 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 257646 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 257647 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 257648 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 257663 7 transmembrane receptor (rhodopsin family) 257665 7 transmembrane receptor (rhodopsin family) 257667 7 transmembrane receptor (rhodopsin family) 257913 7 transmembrane receptor (rhodopsin family) 258266 7 transmembrane receptor (rhodopsin family) 258267 7 transmembrane receptor (rhodopsin family) 258268 7 transmembrane receptor (rhodopsin family) 258269 7 transmembrane receptor (rhodopsin family) 258270 7 transmembrane receptor (rhodopsin family) 258271 7 transmembrane receptor (rhodopsin family) 258272 7 transmembrane receptor (rhodopsin family) 258273 7 transmembrane receptor (rhodopsin family) 258274 7 transmembrane receptor (rhodopsin family) 258275 7 transmembrane receptor (rhodopsin family) 258276 7 transmembrane receptor (rhodopsin family) 258277 7 transmembrane receptor (rhodopsin family) 258278 7 transmembrane receptor (rhodopsin family) 258279 7 transmembrane receptor (rhodopsin family) 258280 7 transmembrane receptor (rhodopsin family) 258281 7 transmembrane receptor (rhodopsin family) 258282 7 transmembrane receptor (rhodopsin family) 258283 7 transmembrane receptor (rhodopsin family) 258284 7 transmembrane receptor (rhodopsin family) 258285 7 transmembrane receptor (rhodopsin family) 258286 7 transmembrane receptor (rhodopsin family) 258287 7 transmembrane receptor (rhodopsin family) 258288 7 transmembrane receptor (rhodopsin family) 258289 7 transmembrane receptor (rhodopsin family) 258290 7 transmembrane receptor (rhodopsin family) 258291 7 transmembrane receptor (rhodopsin family) 258292 7 transmembrane receptor (rhodopsin family) 258293 7 transmembrane receptor (rhodopsin family) 258294 7 transmembrane receptor (rhodopsin family) 258295 7 transmembrane receptor (rhodopsin family) 258296 7 transmembrane receptor (rhodopsin family) 258297 7 transmembrane receptor (rhodopsin family) 258298 7 transmembrane receptor (rhodopsin family) 258300 7 transmembrane receptor (rhodopsin family) 258301 7 transmembrane receptor (rhodopsin family) 258302 7 transmembrane receptor (rhodopsin family) 258303 7 transmembrane receptor (rhodopsin family) 258304 7 transmembrane receptor (rhodopsin family) 258305 7 transmembrane receptor (rhodopsin family) 258306 7 transmembrane receptor (rhodopsin family) 258307 7 transmembrane receptor (rhodopsin family) 258308 7 transmembrane receptor (rhodopsin family) 258309 7 transmembrane receptor (rhodopsin family) 258311 7 transmembrane receptor (rhodopsin family) 258313 7 transmembrane receptor (rhodopsin family) 258314 7 transmembrane receptor (rhodopsin family) 258315 7 transmembrane receptor (rhodopsin family) 258316 7 transmembrane receptor (rhodopsin family) 258317 7 transmembrane receptor (rhodopsin family) 258318 7 transmembrane receptor (rhodopsin family) 258319 7 transmembrane receptor (rhodopsin family) 258320 7 transmembrane receptor (rhodopsin family) 258321 7 transmembrane receptor (rhodopsin family) 258322 7 transmembrane receptor (rhodopsin family) 258323 7 transmembrane receptor (rhodopsin family) 258324 7 transmembrane receptor (rhodopsin family) 258325 7 transmembrane receptor (rhodopsin family) 258326 7 transmembrane receptor (rhodopsin family) 258327 7 transmembrane receptor (rhodopsin family) 258328 7 transmembrane receptor (rhodopsin family) 258329 7 transmembrane receptor (rhodopsin family) 258330 7 transmembrane receptor (rhodopsin family) 258331 7 transmembrane receptor (rhodopsin family) 258333 7 transmembrane receptor (rhodopsin family) 258334 7 transmembrane receptor (rhodopsin family) 258335 7 transmembrane receptor (rhodopsin family) 258336 7 transmembrane receptor (rhodopsin family) 258338 7 transmembrane receptor (rhodopsin family) 258339 7 transmembrane receptor (rhodopsin family) 258340 7 transmembrane receptor (rhodopsin family) 258341 7 transmembrane receptor (rhodopsin family) 258342 7 transmembrane receptor (rhodopsin family) 258343 7 transmembrane receptor (rhodopsin family) 258344 7 transmembrane receptor (rhodopsin family) 258345 7 transmembrane receptor (rhodopsin family) 258346 7 transmembrane receptor (rhodopsin family) 258347 7 transmembrane receptor (rhodopsin family) 258348 7 transmembrane receptor (rhodopsin family) 258350 7 transmembrane receptor (rhodopsin family) 258351 7 transmembrane receptor (rhodopsin family) 258352 7 transmembrane receptor (rhodopsin family) 258353 7 transmembrane receptor (rhodopsin family) 258354 7 transmembrane receptor (rhodopsin family) 258355 7 transmembrane receptor (rhodopsin family) 258356 7 transmembrane receptor (rhodopsin family) 258357 7 transmembrane receptor (rhodopsin family) 258358 7 transmembrane receptor (rhodopsin family) 258359 7 transmembrane receptor (rhodopsin family) 258360 7 transmembrane receptor (rhodopsin family) 258361 7 transmembrane receptor (rhodopsin family) 258362 7 transmembrane receptor (rhodopsin family) 258363 7 transmembrane receptor (rhodopsin family) 258364 7 transmembrane receptor (rhodopsin family) 258365 7 transmembrane receptor (rhodopsin family) 258366 7 transmembrane receptor (rhodopsin family) 258368 7 transmembrane receptor (rhodopsin family) 258369 7 transmembrane receptor (rhodopsin family) 258370 7 transmembrane receptor (rhodopsin family) 258371 7 transmembrane receptor (rhodopsin family) 258372 7 transmembrane receptor (rhodopsin family) 258373 7 transmembrane receptor (rhodopsin family) 258374 7 transmembrane receptor (rhodopsin family) 258375 7 transmembrane receptor (rhodopsin family) 258377 7 transmembrane receptor (rhodopsin family) 258378 7 transmembrane receptor (rhodopsin family) 258379 7 transmembrane receptor (rhodopsin family) 258380 7 transmembrane receptor (rhodopsin family) 258381 7 transmembrane receptor (rhodopsin family) 258382 7 transmembrane receptor (rhodopsin family) 258383 7 transmembrane receptor (rhodopsin family) 258384 7 transmembrane receptor (rhodopsin family) 258385 7 transmembrane receptor (rhodopsin family) 258385 7 transmembrane receptor (rhodopsin family) 258386 7 transmembrane receptor (rhodopsin family) 258387 7 transmembrane receptor (rhodopsin family) 258387 7 transmembrane receptor (rhodopsin family) 258388 7 transmembrane receptor (rhodopsin family) 258389 7 transmembrane receptor (rhodopsin family) 258390 7 transmembrane receptor (rhodopsin family) 258391 7 transmembrane receptor (rhodopsin family) 258392 7 transmembrane receptor (rhodopsin family) 258393 7 transmembrane receptor (rhodopsin family) 258394 7 transmembrane receptor (rhodopsin family) 258395 7 transmembrane receptor (rhodopsin family) 258396 7 transmembrane receptor (rhodopsin family) 258397 7 transmembrane receptor (rhodopsin family) 258398 7 transmembrane receptor (rhodopsin family) 258399 7 transmembrane receptor (rhodopsin family) 258400 7 transmembrane receptor (rhodopsin family) 258401 7 transmembrane receptor (rhodopsin family) 258402 7 transmembrane receptor (rhodopsin family) 258404 7 transmembrane receptor (rhodopsin family) 258405 7 transmembrane receptor (rhodopsin family) 258406 7 transmembrane receptor (rhodopsin family) 258407 7 transmembrane receptor (rhodopsin family) 258408 7 transmembrane receptor (rhodopsin family) 258409 7 transmembrane receptor (rhodopsin family) 258410 7 transmembrane receptor (rhodopsin family) 258410 7 transmembrane receptor (rhodopsin family) 258411 7 transmembrane receptor (rhodopsin family) 258412 7 transmembrane receptor (rhodopsin family) 258412 7 transmembrane receptor (rhodopsin family) 258413 7 transmembrane receptor (rhodopsin family) 258414 7 transmembrane receptor (rhodopsin family) 258415 7 transmembrane receptor (rhodopsin family) 258415 7 transmembrane receptor (rhodopsin family) 258416 7 transmembrane receptor (rhodopsin family) 258417 7 transmembrane receptor (rhodopsin family) 258418 7 transmembrane receptor (rhodopsin family) 258418 7 transmembrane receptor (rhodopsin family) 258419 7 transmembrane receptor (rhodopsin family) 258420 7 transmembrane receptor (rhodopsin family) 258421 7 transmembrane receptor (rhodopsin family) 258422 7 transmembrane receptor (rhodopsin family) 258423 7 transmembrane receptor (rhodopsin family) 258424 7 transmembrane receptor (rhodopsin family) 258425 7 transmembrane receptor (rhodopsin family) 258426 7 transmembrane receptor (rhodopsin family) 258427 7 transmembrane receptor (rhodopsin family) 258428 7 transmembrane receptor (rhodopsin family) 258429 7 transmembrane receptor (rhodopsin family) 258430 7 transmembrane receptor (rhodopsin family) 258431 7 transmembrane receptor (rhodopsin family) 258432 7 transmembrane receptor (rhodopsin family) 258433 7 transmembrane receptor (rhodopsin family) 258434 7 transmembrane receptor (rhodopsin family) 258435 7 transmembrane receptor (rhodopsin family) 258436 7 transmembrane receptor (rhodopsin family) 258436 7 transmembrane receptor (rhodopsin family) 258437 7 transmembrane receptor (rhodopsin family) 258438 7 transmembrane receptor (rhodopsin family) 258439 7 transmembrane receptor (rhodopsin family) 258440 7 transmembrane receptor (rhodopsin family) 258441 7 transmembrane receptor (rhodopsin family) 258442 7 transmembrane receptor (rhodopsin family) 258442 7 transmembrane receptor (rhodopsin family) 258443 7 transmembrane receptor (rhodopsin family) 258444 7 transmembrane receptor (rhodopsin family) 258444 7 transmembrane receptor (rhodopsin family) 258445 7 transmembrane receptor (rhodopsin family) 258445 7 transmembrane receptor (rhodopsin family) 258445 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 258445 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 258445 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 258446 7 transmembrane receptor (rhodopsin family) 258447 7 transmembrane receptor (rhodopsin family) 258448 7 transmembrane receptor (rhodopsin family) 258449 7 transmembrane receptor (rhodopsin family) 258450 7 transmembrane receptor (rhodopsin family) 258451 7 transmembrane receptor (rhodopsin family) 258453 7 transmembrane receptor (rhodopsin family) 258454 7 transmembrane receptor (rhodopsin family) 258455 7 transmembrane receptor (rhodopsin family) 258456 7 transmembrane receptor (rhodopsin family) 258457 7 transmembrane receptor (rhodopsin family) 258458 7 transmembrane receptor (rhodopsin family) 258459 7 transmembrane receptor (rhodopsin family) 258460 7 transmembrane receptor (rhodopsin family) 258462 7 transmembrane receptor (rhodopsin family) 258463 7 transmembrane receptor (rhodopsin family) 258464 7 transmembrane receptor (rhodopsin family) 258465 7 transmembrane receptor (rhodopsin family) 258465 7 transmembrane receptor (rhodopsin family) 258466 7 transmembrane receptor (rhodopsin family) 258468 7 transmembrane receptor (rhodopsin family) 258469 7 transmembrane receptor (rhodopsin family) 258470 7 transmembrane receptor (rhodopsin family) 258471 7 transmembrane receptor (rhodopsin family) 258472 7 transmembrane receptor (rhodopsin family) 258472 7 transmembrane receptor (rhodopsin family) 258473 7 transmembrane receptor (rhodopsin family) 258474 7 transmembrane receptor (rhodopsin family) 258475 7 transmembrane receptor (rhodopsin family) 258477 7 transmembrane receptor (rhodopsin family) 258478 7 transmembrane receptor (rhodopsin family) 258479 7 transmembrane receptor (rhodopsin family) 258480 7 transmembrane receptor (rhodopsin family) 258481 7 transmembrane receptor (rhodopsin family) 258482 7 transmembrane receptor (rhodopsin family) 258483 7 transmembrane receptor (rhodopsin family) 258484 7 transmembrane receptor (rhodopsin family) 258484 7 transmembrane receptor (rhodopsin family) 258485 7 transmembrane receptor (rhodopsin family) 258486 7 transmembrane receptor (rhodopsin family) 258487 7 transmembrane receptor (rhodopsin family) 258488 7 transmembrane receptor (rhodopsin family) 258488 7 transmembrane receptor (rhodopsin family) 258489 7 transmembrane receptor (rhodopsin family) 258490 7 transmembrane receptor (rhodopsin family) 258491 7 transmembrane receptor (rhodopsin family) 258492 7 transmembrane receptor (rhodopsin family) 258493 7 transmembrane receptor (rhodopsin family) 258494 7 transmembrane receptor (rhodopsin family) 258495 7 transmembrane receptor (rhodopsin family) 258497 7 transmembrane receptor (rhodopsin family) 258498 7 transmembrane receptor (rhodopsin family) 258499 7 transmembrane receptor (rhodopsin family) 258499 7 transmembrane receptor (rhodopsin family) 258500 7 transmembrane receptor (rhodopsin family) 258500 7 transmembrane receptor (rhodopsin family) 258501 7 transmembrane receptor (rhodopsin family) 258502 7 transmembrane receptor (rhodopsin family) 258503 7 transmembrane receptor (rhodopsin family) 258504 7 transmembrane receptor (rhodopsin family) 258505 7 transmembrane receptor (rhodopsin family) 258506 7 transmembrane receptor (rhodopsin family) 258507 7 transmembrane receptor (rhodopsin family) 258508 7 transmembrane receptor (rhodopsin family) 258509 7 transmembrane receptor (rhodopsin family) 258509 7 transmembrane receptor (rhodopsin family) 258511 7 transmembrane receptor (rhodopsin family) 258511 7 transmembrane receptor (rhodopsin family) 258512 7 transmembrane receptor (rhodopsin family) 258513 7 transmembrane receptor (rhodopsin family) 258513 7 transmembrane receptor (rhodopsin family) 258515 7 transmembrane receptor (rhodopsin family) 258515 7 transmembrane receptor (rhodopsin family) 258516 7 transmembrane receptor (rhodopsin family) 258516 7 transmembrane receptor (rhodopsin family) 258517 7 transmembrane receptor (rhodopsin family) 258517 7 transmembrane receptor (rhodopsin family) 258518 7 transmembrane receptor (rhodopsin family) 258518 7 transmembrane receptor (rhodopsin family) 258519 7 transmembrane receptor (rhodopsin family) 258520 7 transmembrane receptor (rhodopsin family) 258521 7 transmembrane receptor (rhodopsin family) 258522 7 transmembrane receptor (rhodopsin family) 258523 7 transmembrane receptor (rhodopsin family) 258524 7 transmembrane receptor (rhodopsin family) 258525 7 transmembrane receptor (rhodopsin family) 258526 7 transmembrane receptor (rhodopsin family) 258527 7 transmembrane receptor (rhodopsin family) 258528 7 transmembrane receptor (rhodopsin family) 258529 7 transmembrane receptor (rhodopsin family) 258530 7 transmembrane receptor (rhodopsin family) 258531 7 transmembrane receptor (rhodopsin family) 258532 7 transmembrane receptor (rhodopsin family) 258533 7 transmembrane receptor (rhodopsin family) 258534 7 transmembrane receptor (rhodopsin family) 258536 7 transmembrane receptor (rhodopsin family) 258537 7 transmembrane receptor (rhodopsin family) 258538 7 transmembrane receptor (rhodopsin family) 258539 7 transmembrane receptor (rhodopsin family) 258539 7 transmembrane receptor (rhodopsin family) 258540 7 transmembrane receptor (rhodopsin family) 258541 7 transmembrane receptor (rhodopsin family) 258542 7 transmembrane receptor (rhodopsin family) 258543 7 transmembrane receptor (rhodopsin family) 258544 7 transmembrane receptor (rhodopsin family) 258545 7 transmembrane receptor (rhodopsin family) 258546 7 transmembrane receptor (rhodopsin family) 258547 7 transmembrane receptor (rhodopsin family) 258548 7 transmembrane receptor (rhodopsin family) 258549 7 transmembrane receptor (rhodopsin family) 258551 7 transmembrane receptor (rhodopsin family) 258552 7 transmembrane receptor (rhodopsin family) 258555 7 transmembrane receptor (rhodopsin family) 258557 7 transmembrane receptor (rhodopsin family) 258558 7 transmembrane receptor (rhodopsin family) 258559 7 transmembrane receptor (rhodopsin family) 258560 7 transmembrane receptor (rhodopsin family) 258561 7 transmembrane receptor (rhodopsin family) 258562 7 transmembrane receptor (rhodopsin family) 258563 7 transmembrane receptor (rhodopsin family) 258564 7 transmembrane receptor (rhodopsin family) 258565 7 transmembrane receptor (rhodopsin family) 258566 7 transmembrane receptor (rhodopsin family) 258568 7 transmembrane receptor (rhodopsin family) 258569 7 transmembrane receptor (rhodopsin family) 258570 7 transmembrane receptor (rhodopsin family) 258571 7 transmembrane receptor (rhodopsin family) 258572 7 transmembrane receptor (rhodopsin family) 258573 7 transmembrane receptor (rhodopsin family) 258574 7 transmembrane receptor (rhodopsin family) 258575 7 transmembrane receptor (rhodopsin family) 258577 7 transmembrane receptor (rhodopsin family) 258578 7 transmembrane receptor (rhodopsin family) 258579 7 transmembrane receptor (rhodopsin family) 258580 7 transmembrane receptor (rhodopsin family) 258581 7 transmembrane receptor (rhodopsin family) 258582 7 transmembrane receptor (rhodopsin family) 258583 7 transmembrane receptor (rhodopsin family) 258584 7 transmembrane receptor (rhodopsin family) 258585 7 transmembrane receptor (rhodopsin family) 258586 7 transmembrane receptor (rhodopsin family) 258587 7 transmembrane receptor (rhodopsin family) 258589 7 transmembrane receptor (rhodopsin family) 258590 7 transmembrane receptor (rhodopsin family) 258591 7 transmembrane receptor (rhodopsin family) 258592 7 transmembrane receptor (rhodopsin family) 258593 7 transmembrane receptor (rhodopsin family) 258594 7 transmembrane receptor (rhodopsin family) 258595 7 transmembrane receptor (rhodopsin family) 258597 7 transmembrane receptor (rhodopsin family) 258598 7 transmembrane receptor (rhodopsin family) 258600 7 transmembrane receptor (rhodopsin family) 258601 7 transmembrane receptor (rhodopsin family) 258602 7 transmembrane receptor (rhodopsin family) 258603 7 transmembrane receptor (rhodopsin family) 258603 7 transmembrane receptor (rhodopsin family) 258604 7 transmembrane receptor (rhodopsin family) 258604 7 transmembrane receptor (rhodopsin family) 258605 7 transmembrane receptor (rhodopsin family) 258606 7 transmembrane receptor (rhodopsin family) 258607 7 transmembrane receptor (rhodopsin family) 258608 7 transmembrane receptor (rhodopsin family) 258609 7 transmembrane receptor (rhodopsin family) 258609 7 transmembrane receptor (rhodopsin family) 258610 7 transmembrane receptor (rhodopsin family) 258611 7 transmembrane receptor (rhodopsin family) 258611 7 transmembrane receptor (rhodopsin family) 258612 7 transmembrane receptor (rhodopsin family) 258612 7 transmembrane receptor (rhodopsin family) 258613 7 transmembrane receptor (rhodopsin family) 258614 7 transmembrane receptor (rhodopsin family) 258615 7 transmembrane receptor (rhodopsin family) 258616 7 transmembrane receptor (rhodopsin family) 258617 7 transmembrane receptor (rhodopsin family) 258618 7 transmembrane receptor (rhodopsin family) 258620 7 transmembrane receptor (rhodopsin family) 258621 7 transmembrane receptor (rhodopsin family) 258623 7 transmembrane receptor (rhodopsin family) 258624 7 transmembrane receptor (rhodopsin family) 258625 7 transmembrane receptor (rhodopsin family) 258626 7 transmembrane receptor (rhodopsin family) 258627 7 transmembrane receptor (rhodopsin family) 258628 7 transmembrane receptor (rhodopsin family) 258629 7 transmembrane receptor (rhodopsin family) 258630 7 transmembrane receptor (rhodopsin family) 258631 7 transmembrane receptor (rhodopsin family) 258632 7 transmembrane receptor (rhodopsin family) 258633 7 transmembrane receptor (rhodopsin family) 258634 7 transmembrane receptor (rhodopsin family) 258635 7 transmembrane receptor (rhodopsin family) 258636 7 transmembrane receptor (rhodopsin family) 258638 7 transmembrane receptor (rhodopsin family) 258639 7 transmembrane receptor (rhodopsin family) 258640 7 transmembrane receptor (rhodopsin family) 258641 7 transmembrane receptor (rhodopsin family) 258642 7 transmembrane receptor (rhodopsin family) 258643 7 transmembrane receptor (rhodopsin family) 258644 7 transmembrane receptor (rhodopsin family) 258645 7 transmembrane receptor (rhodopsin family) 258647 7 transmembrane receptor (rhodopsin family) 258647 7 transmembrane receptor (rhodopsin family) 258648 7 transmembrane receptor (rhodopsin family) 258649 7 transmembrane receptor (rhodopsin family) 258649 7 transmembrane receptor (rhodopsin family) 258650 7 transmembrane receptor (rhodopsin family) 258650 7 transmembrane receptor (rhodopsin family) 258651 7 transmembrane receptor (rhodopsin family) 258652 7 transmembrane receptor (rhodopsin family) 258653 7 transmembrane receptor (rhodopsin family) 258654 7 transmembrane receptor (rhodopsin family) 258655 7 transmembrane receptor (rhodopsin family) 258656 7 transmembrane receptor (rhodopsin family) 258657 7 transmembrane receptor (rhodopsin family) 258658 7 transmembrane receptor (rhodopsin family) 258659 7 transmembrane receptor (rhodopsin family) 258660 7 transmembrane receptor (rhodopsin family) 258661 7 transmembrane receptor (rhodopsin family) 258661 7 transmembrane receptor (rhodopsin family) 258661 7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand 258662 7 transmembrane receptor (rhodopsin family) 258663 7 transmembrane receptor (rhodopsin family) 258665 7 transmembrane receptor (rhodopsin family) 258666 7 transmembrane receptor (rhodopsin family) 258667 7 transmembrane receptor (rhodopsin family) 258668 7 transmembrane receptor (rhodopsin family) 258669 7 transmembrane receptor (rhodopsin family) 258670 7 transmembrane receptor (rhodopsin family) 258671 7 transmembrane receptor (rhodopsin family) 258672 7 transmembrane receptor (rhodopsin family) 258673 7 transmembrane receptor (rhodopsin family) 258674 7 transmembrane receptor (rhodopsin family) 258675 7 transmembrane receptor (rhodopsin family) 258676 7 transmembrane receptor (rhodopsin family) 258677 7 transmembrane receptor (rhodopsin family) 258678 7 transmembrane receptor (rhodopsin family) 258679 7 transmembrane receptor (rhodopsin family) 258680 7 transmembrane receptor (rhodopsin family) 258681 7 transmembrane receptor (rhodopsin family) 258682 7 transmembrane receptor (rhodopsin family) 258683 7 transmembrane receptor (rhodopsin family) 258684 7 transmembrane receptor (rhodopsin family) 258685 7 transmembrane receptor (rhodopsin family) 258686 7 transmembrane receptor (rhodopsin family) 258687 7 transmembrane receptor (rhodopsin family) 258688 7 transmembrane receptor (rhodopsin family) 258689 7 transmembrane receptor (rhodopsin family) 258690 7 transmembrane receptor (rhodopsin family) 258691 7 transmembrane receptor (rhodopsin family) 258692 7 transmembrane receptor (rhodopsin family) 258693 7 transmembrane receptor (rhodopsin family) 258694 7 transmembrane receptor (rhodopsin family) 258695 7 transmembrane receptor (rhodopsin family) 258696 7 transmembrane receptor (rhodopsin family) 258697 7 transmembrane receptor (rhodopsin family) 258698 7 transmembrane receptor (rhodopsin family) 258699 7 transmembrane receptor (rhodopsin family) 258700 7 transmembrane receptor (rhodopsin family) 258701 7 transmembrane receptor (rhodopsin family) 258702 7 transmembrane receptor (rhodopsin family) 258703 7 transmembrane receptor (rhodopsin family) 258704 7 transmembrane receptor (rhodopsin family) 258705 7 transmembrane receptor (rhodopsin family) 258706 7 transmembrane receptor (rhodopsin family) 258707 7 transmembrane receptor (rhodopsin family) 258708 7 transmembrane receptor (rhodopsin family) 258709 7 transmembrane receptor (rhodopsin family) 258710 7 transmembrane receptor (rhodopsin family) 258711 7 transmembrane receptor (rhodopsin family) 258712 7 transmembrane receptor (rhodopsin family) 258712 7 transmembrane receptor (rhodopsin family) 258713 7 transmembrane receptor (rhodopsin family) 258714 7 transmembrane receptor (rhodopsin family) 258715 7 transmembrane receptor (rhodopsin family) 258716 7 transmembrane receptor (rhodopsin family) 258717 7 transmembrane receptor (rhodopsin family) 258718 7 transmembrane receptor (rhodopsin family) 258719 7 transmembrane receptor (rhodopsin family) 258720 7 transmembrane receptor (rhodopsin family) 258721 7 transmembrane receptor (rhodopsin family) 258722 7 transmembrane receptor (rhodopsin family) 258723 7 transmembrane receptor (rhodopsin family) 258723 7 transmembrane receptor (rhodopsin family) 258724 7 transmembrane receptor (rhodopsin family) 258725 7 transmembrane receptor (rhodopsin family) 258726 7 transmembrane receptor (rhodopsin family) 258727 7 transmembrane receptor (rhodopsin family) 258728 7 transmembrane receptor (rhodopsin family) 258729 7 transmembrane receptor (rhodopsin family) 258730 7 transmembrane receptor (rhodopsin family) 258731 7 transmembrane receptor (rhodopsin family) 258732 7 transmembrane receptor (rhodopsin family) 258733 7 transmembrane receptor (rhodopsin family) 258734 7 transmembrane receptor (rhodopsin family) 258735 7 transmembrane receptor (rhodopsin family) 258736 7 transmembrane receptor (rhodopsin family) 258737 7 transmembrane receptor (rhodopsin family) 258738 7 transmembrane receptor (rhodopsin family) 258739 7 transmembrane receptor (rhodopsin family) 258740 7 transmembrane receptor (rhodopsin family) 258741 7 transmembrane receptor (rhodopsin family) 258742 7 transmembrane receptor (rhodopsin family) 258742 7 transmembrane receptor (rhodopsin family) 258743 7 transmembrane receptor (rhodopsin family) 258744 7 transmembrane receptor (rhodopsin family) 258745 7 transmembrane receptor (rhodopsin family) 258746 7 transmembrane receptor (rhodopsin family) 258747 7 transmembrane receptor (rhodopsin family) 258748 7 transmembrane receptor (rhodopsin family) 258749 7 transmembrane receptor (rhodopsin family) 258750 7 transmembrane receptor (rhodopsin family) 258751 7 transmembrane receptor (rhodopsin family) 258752 7 transmembrane receptor (rhodopsin family) 258753 7 transmembrane receptor (rhodopsin family) 258754 7 transmembrane receptor (rhodopsin family) 258755 7 transmembrane receptor (rhodopsin family) 258755 7 transmembrane receptor (rhodopsin family) 258756 7 transmembrane receptor (rhodopsin family) 258757 7 transmembrane receptor (rhodopsin family) 258758 7 transmembrane receptor (rhodopsin family) 258759 7 transmembrane receptor (rhodopsin family) 258760 7 transmembrane receptor (rhodopsin family) 258762 7 transmembrane receptor (rhodopsin family) 258763 7 transmembrane receptor (rhodopsin family) 258764 7 transmembrane receptor (rhodopsin family) 258765 7 transmembrane receptor (rhodopsin family) 258766 7 transmembrane receptor (rhodopsin family) 258767 7 transmembrane receptor (rhodopsin family) 258768 7 transmembrane receptor (rhodopsin family) 258769 7 transmembrane receptor (rhodopsin family) 258770 7 transmembrane receptor (rhodopsin family) 258771 7 transmembrane receptor (rhodopsin family) 258772 7 transmembrane receptor (rhodopsin family) 258773 7 transmembrane receptor (rhodopsin family) 258774 7 transmembrane receptor (rhodopsin family) 258775 7 transmembrane receptor (rhodopsin family) 258776 7 transmembrane receptor (rhodopsin family) 258777 7 transmembrane receptor (rhodopsin family) 258778 7 transmembrane receptor (rhodopsin family) 258780 7 transmembrane receptor (rhodopsin family) 258781 7 transmembrane receptor (rhodopsin family) 258782 7 transmembrane receptor (rhodopsin family) 258783 7 transmembrane receptor (rhodopsin family) 258784 7 transmembrane receptor (rhodopsin family) 258785 7 transmembrane receptor (rhodopsin family) 258786 7 transmembrane receptor (rhodopsin family) 258787 7 transmembrane receptor (rhodopsin family) 258788 7 transmembrane receptor (rhodopsin family) 258789 7 transmembrane receptor (rhodopsin family) 258790 7 transmembrane receptor (rhodopsin family) 258791 7 transmembrane receptor (rhodopsin family) 258792 7 transmembrane receptor (rhodopsin family) 258793 7 transmembrane receptor (rhodopsin family) 258794 7 transmembrane receptor (rhodopsin family) 258797 7 transmembrane receptor (rhodopsin family) 258798 7 transmembrane receptor (rhodopsin family) 258799 7 transmembrane receptor (rhodopsin family) 258800 7 transmembrane receptor (rhodopsin family) 258801 7 transmembrane receptor (rhodopsin family) 258802 7 transmembrane receptor (rhodopsin family) 258803 7 transmembrane receptor (rhodopsin family) 258804 7 transmembrane receptor (rhodopsin family) 258805 7 transmembrane receptor (rhodopsin family) 258806 7 transmembrane receptor (rhodopsin family) 258807 7 transmembrane receptor (rhodopsin family) 258808 7 transmembrane receptor (rhodopsin family) 258809 7 transmembrane receptor (rhodopsin family) 258810 7 transmembrane receptor (rhodopsin family) 258811 7 transmembrane receptor (rhodopsin family) 258812 7 transmembrane receptor (rhodopsin family) 258814 7 transmembrane receptor (rhodopsin family) 258815 7 transmembrane receptor (rhodopsin family) 258816 7 transmembrane receptor (rhodopsin family) 258817 7 transmembrane receptor (rhodopsin family) 258818 7 transmembrane receptor (rhodopsin family) 258819 7 transmembrane receptor (rhodopsin family) 258820 7 transmembrane receptor (rhodopsin family) 258821 7 transmembrane receptor (rhodopsin family) 258822 7 transmembrane receptor (rhodopsin family) 258823 7 transmembrane receptor (rhodopsin family) 258824 7 transmembrane receptor (rhodopsin family) 258825 7 transmembrane receptor (rhodopsin family) 258826 7 transmembrane receptor (rhodopsin family) 258828 7 transmembrane receptor (rhodopsin family) 258829 7 transmembrane receptor (rhodopsin family) 258830 7 transmembrane receptor (rhodopsin family) 258831 7 transmembrane receptor (rhodopsin family) 258832 7 transmembrane receptor (rhodopsin family) 258833 7 transmembrane receptor (rhodopsin family) 258834 7 transmembrane receptor (rhodopsin family) 258835 7 transmembrane receptor (rhodopsin family) 258836 7 transmembrane receptor (rhodopsin family) 258837 7 transmembrane receptor (rhodopsin family) 258838 7 transmembrane receptor (rhodopsin family) 258840 7 transmembrane receptor (rhodopsin family) 258841 7 transmembrane receptor (rhodopsin family) 258842 7 transmembrane receptor (rhodopsin family) 258843 7 transmembrane receptor (rhodopsin family) 258844 7 transmembrane receptor (rhodopsin family) 258845 7 transmembrane receptor (rhodopsin family) 258846 7 transmembrane receptor (rhodopsin family) 258850 7 transmembrane receptor (rhodopsin family) 258851 7 transmembrane receptor (rhodopsin family) 258852 7 transmembrane receptor (rhodopsin family) 258853 7 transmembrane receptor (rhodopsin family) 258854 7 transmembrane receptor (rhodopsin family) 258855 7 transmembrane receptor (rhodopsin family) 258856 7 transmembrane receptor (rhodopsin family) 258857 7 transmembrane receptor (rhodopsin family) 258859 7 transmembrane receptor (rhodopsin family) 258861 7 transmembrane receptor (rhodopsin family) 258862 7 transmembrane receptor (rhodopsin family) 258863 7 transmembrane receptor (rhodopsin family) 258865 7 transmembrane receptor (rhodopsin family) 258866 7 transmembrane receptor (rhodopsin family) 258867 7 transmembrane receptor (rhodopsin family) 258868 7 transmembrane receptor (rhodopsin family) 258871 7 transmembrane receptor (rhodopsin family) 258872 7 transmembrane receptor (rhodopsin family) 258873 7 transmembrane receptor (rhodopsin family) 258874 7 transmembrane receptor (rhodopsin family) 258875 7 transmembrane receptor (rhodopsin family) 258877 7 transmembrane receptor (rhodopsin family) 258879 7 transmembrane receptor (rhodopsin family) 258880 7 transmembrane receptor (rhodopsin family) 258881 7 transmembrane receptor (rhodopsin family) 258882 7 transmembrane receptor (rhodopsin family) 258883 7 transmembrane receptor (rhodopsin family) 258886 7 transmembrane receptor (rhodopsin family) 258887 7 transmembrane receptor (rhodopsin family) 258888 7 transmembrane receptor (rhodopsin family) 258889 7 transmembrane receptor (rhodopsin family) 258890 7 transmembrane receptor (rhodopsin family) 258891 7 transmembrane receptor (rhodopsin family) 258892 7 transmembrane receptor (rhodopsin family) 258893 7 transmembrane receptor (rhodopsin family) 258894 7 transmembrane receptor (rhodopsin family) 258895 7 transmembrane receptor (rhodopsin family) 258896 7 transmembrane receptor (rhodopsin family) 258897 7 transmembrane receptor (rhodopsin family) 258898 7 transmembrane receptor (rhodopsin family) 258899 7 transmembrane receptor (rhodopsin family) 258900 7 transmembrane receptor (rhodopsin family) 258901 7 transmembrane receptor (rhodopsin family) 258902 7 transmembrane receptor (rhodopsin family) 258903 7 transmembrane receptor (rhodopsin family) 258904 7 transmembrane receptor (rhodopsin family) 258905 7 transmembrane receptor (rhodopsin family) 258907 7 transmembrane receptor (rhodopsin family) 258908 7 transmembrane receptor (rhodopsin family) 258910 7 transmembrane receptor (rhodopsin family) 258912 7 transmembrane receptor (rhodopsin family) 258913 7 transmembrane receptor (rhodopsin family) 258914 7 transmembrane receptor (rhodopsin family) 258915 7 transmembrane receptor (rhodopsin family) 258917 7 transmembrane receptor (rhodopsin family) 258918 7 transmembrane receptor (rhodopsin family) 258919 7 transmembrane receptor (rhodopsin family) 258920 7 transmembrane receptor (rhodopsin family) 258921 7 transmembrane receptor (rhodopsin family) 258922 7 transmembrane receptor (rhodopsin family) 258923 7 transmembrane receptor (rhodopsin family) 258924 7 transmembrane receptor (rhodopsin family) 258925 7 transmembrane receptor (rhodopsin family) 258927 7 transmembrane receptor (rhodopsin family) 258928 7 transmembrane receptor (rhodopsin family) 258929 7 transmembrane receptor (rhodopsin family) 258930 7 transmembrane receptor (rhodopsin family) 258931 7 transmembrane receptor (rhodopsin family) 258932 7 transmembrane receptor (rhodopsin family) 258933 7 transmembrane receptor (rhodopsin family) 258934 7 transmembrane receptor (rhodopsin family) 258935 7 transmembrane receptor (rhodopsin family) 258937 7 transmembrane receptor (rhodopsin family) 258938 7 transmembrane receptor (rhodopsin family) 258939 7 transmembrane receptor (rhodopsin family) 258940 7 transmembrane receptor (rhodopsin family) 258941 7 transmembrane receptor (rhodopsin family) 258942 7 transmembrane receptor (rhodopsin family) 258943 7 transmembrane receptor (rhodopsin family) 258944 7 transmembrane receptor (rhodopsin family) 258945 7 transmembrane receptor (rhodopsin family) 258946 7 transmembrane receptor (rhodopsin family) 258947 7 transmembrane receptor (rhodopsin family) 258949 7 transmembrane receptor (rhodopsin family) 258950 7 transmembrane receptor (rhodopsin family) 258951 7 transmembrane receptor (rhodopsin family) 258952 7 transmembrane receptor (rhodopsin family) 258953 7 transmembrane receptor (rhodopsin family) 258954 7 transmembrane receptor (rhodopsin family) 258955 7 transmembrane receptor (rhodopsin family) 258956 7 transmembrane receptor (rhodopsin family) 258959 7 transmembrane receptor (rhodopsin family) 258960 7 transmembrane receptor (rhodopsin family) 258961 7 transmembrane receptor (rhodopsin family) 258962 7 transmembrane receptor (rhodopsin family) 258963 7 transmembrane receptor (rhodopsin family) 258964 7 transmembrane receptor (rhodopsin family) 258967 7 transmembrane receptor (rhodopsin family) 258968 7 transmembrane receptor (rhodopsin family) 258969 7 transmembrane receptor (rhodopsin family) 258970 7 transmembrane receptor (rhodopsin family) 258971 7 transmembrane receptor (rhodopsin family) 258972 7 transmembrane receptor (rhodopsin family) 258973 7 transmembrane receptor (rhodopsin family) 258974 7 transmembrane receptor (rhodopsin family) 258975 7 transmembrane receptor (rhodopsin family) 258976 7 transmembrane receptor (rhodopsin family) 258977 7 transmembrane receptor (rhodopsin family) 258979 7 transmembrane receptor (rhodopsin family) 258980 7 transmembrane receptor (rhodopsin family) 258981 7 transmembrane receptor (rhodopsin family) 258982 7 transmembrane receptor (rhodopsin family) 258983 7 transmembrane receptor (rhodopsin family) 258985 7 transmembrane receptor (rhodopsin family) 258986 7 transmembrane receptor (rhodopsin family) 258987 7 transmembrane receptor (rhodopsin family) 258988 7 transmembrane receptor (rhodopsin family) 258989 7 transmembrane receptor (rhodopsin family) 258990 7 transmembrane receptor (rhodopsin family) 258991 7 transmembrane receptor (rhodopsin family) 258992 7 transmembrane receptor (rhodopsin family) 258993 7 transmembrane receptor (rhodopsin family) 258995 7 transmembrane receptor (rhodopsin family) 258996 7 transmembrane receptor (rhodopsin family) 258997 7 transmembrane receptor (rhodopsin family) 258998 7 transmembrane receptor (rhodopsin family) 258999 7 transmembrane receptor (rhodopsin family) 259000 7 transmembrane receptor (rhodopsin family) 259001 7 transmembrane receptor (rhodopsin family) 259002 7 transmembrane receptor (rhodopsin family) 259003 7 transmembrane receptor (rhodopsin family) 259004 7 transmembrane receptor (rhodopsin family) 259005 7 transmembrane receptor (rhodopsin family) 259006 7 transmembrane receptor (rhodopsin family) 259007 7 transmembrane receptor (rhodopsin family) 259008 7 transmembrane receptor (rhodopsin family) 259009 7 transmembrane receptor (rhodopsin family) 259010 7 transmembrane receptor (rhodopsin family) 259011 7 transmembrane receptor (rhodopsin family) 259012 7 transmembrane receptor (rhodopsin family) 259013 7 transmembrane receptor (rhodopsin family) 259014 7 transmembrane receptor (rhodopsin family) 259015 7 transmembrane receptor (rhodopsin family) 259016 7 transmembrane receptor (rhodopsin family) 259017 7 transmembrane receptor (rhodopsin family) 259018 7 transmembrane receptor (rhodopsin family) 259019 7 transmembrane receptor (rhodopsin family) 259020 7 transmembrane receptor (rhodopsin family) 259021 7 transmembrane receptor (rhodopsin family) 259022 7 transmembrane receptor (rhodopsin family) 259023 7 transmembrane receptor (rhodopsin family) 259024 7 transmembrane receptor (rhodopsin family) 259025 7 transmembrane receptor (rhodopsin family) 259026 7 transmembrane receptor (rhodopsin family) 259027 7 transmembrane receptor (rhodopsin family) 259030 7 transmembrane receptor (rhodopsin family) 259031 7 transmembrane receptor (rhodopsin family) 259032 7 transmembrane receptor (rhodopsin family) 259033 7 transmembrane receptor (rhodopsin family) 259034 7 transmembrane receptor (rhodopsin family) 259035 7 transmembrane receptor (rhodopsin family) 259036 7 transmembrane receptor (rhodopsin family) 259037 7 transmembrane receptor (rhodopsin family) 259038 7 transmembrane receptor (rhodopsin family) 259039 7 transmembrane receptor (rhodopsin family) 259040 7 transmembrane receptor (rhodopsin family) 259041 7 transmembrane receptor (rhodopsin family) 259042 7 transmembrane receptor (rhodopsin family) 259042 7 transmembrane receptor (rhodopsin family) 259044 7 transmembrane receptor (rhodopsin family) 259045 7 transmembrane receptor (rhodopsin family) 259046 7 transmembrane receptor (rhodopsin family) 259047 7 transmembrane receptor (rhodopsin family) 259048 7 transmembrane receptor (rhodopsin family) 259049 7 transmembrane receptor (rhodopsin family) 259050 7 transmembrane receptor (rhodopsin family) 259051 7 transmembrane receptor (rhodopsin family) 259052 7 transmembrane receptor (rhodopsin family) 259053 7 transmembrane receptor (rhodopsin family) 259054 7 transmembrane receptor (rhodopsin family) 259055 7 transmembrane receptor (rhodopsin family) 259056 7 transmembrane receptor (rhodopsin family) 259057 7 transmembrane receptor (rhodopsin family) 259058 7 transmembrane receptor (rhodopsin family) 259061 7 transmembrane receptor (rhodopsin family) 259062 7 transmembrane receptor (rhodopsin family) 259063 7 transmembrane receptor (rhodopsin family) 259064 7 transmembrane receptor (rhodopsin family) 259066 7 transmembrane receptor (rhodopsin family) 259067 7 transmembrane receptor (rhodopsin family) 259069 7 transmembrane receptor (rhodopsin family) 259070 7 transmembrane receptor (rhodopsin family) 259071 7 transmembrane receptor (rhodopsin family) 259072 7 transmembrane receptor (rhodopsin family) 259073 7 transmembrane receptor (rhodopsin family) 259075 7 transmembrane receptor (rhodopsin family) 259078 7 transmembrane receptor (rhodopsin family) 259080 7 transmembrane receptor (rhodopsin family) 259081 7 transmembrane receptor (rhodopsin family) 259082 7 transmembrane receptor (rhodopsin family) 259083 7 transmembrane receptor (rhodopsin family) 259084 7 transmembrane receptor (rhodopsin family) 259085 7 transmembrane receptor (rhodopsin family) 259086 7 transmembrane receptor (rhodopsin family) 259087 7 transmembrane receptor (rhodopsin family) 259088 7 transmembrane receptor (rhodopsin family) 259089 7 transmembrane receptor (rhodopsin family) 259090 7 transmembrane receptor (rhodopsin family) 259091 7 transmembrane receptor (rhodopsin family) 259092 7 transmembrane receptor (rhodopsin family) 259093 7 transmembrane receptor (rhodopsin family) 259094 7 transmembrane receptor (rhodopsin family) 259095 7 transmembrane receptor (rhodopsin family) 259096 7 transmembrane receptor (rhodopsin family) 259097 7 transmembrane receptor (rhodopsin family) 259098 7 transmembrane receptor (rhodopsin family) 259099 7 transmembrane receptor (rhodopsin family) 259100 7 transmembrane receptor (rhodopsin family) 259101 7 transmembrane receptor (rhodopsin family) 259102 7 transmembrane receptor (rhodopsin family) 259103 7 transmembrane receptor (rhodopsin family) 259104 7 transmembrane receptor (rhodopsin family) 259105 7 transmembrane receptor (rhodopsin family) 259106 7 transmembrane receptor (rhodopsin family) 259107 7 transmembrane receptor (rhodopsin family) 259108 7 transmembrane receptor (rhodopsin family) 259109 7 transmembrane receptor (rhodopsin family) 259110 7 transmembrane receptor (rhodopsin family) 259111 7 transmembrane receptor (rhodopsin family) 259112 7 transmembrane receptor (rhodopsin family) 259113 7 transmembrane receptor (rhodopsin family) 259114 7 transmembrane receptor (rhodopsin family) 259115 7 transmembrane receptor (rhodopsin family) 259116 7 transmembrane receptor (rhodopsin family) 259117 7 transmembrane receptor (rhodopsin family) 259118 7 transmembrane receptor (rhodopsin family) 259119 7 transmembrane receptor (rhodopsin family) 259122 7 transmembrane receptor (rhodopsin family) 259123 7 transmembrane receptor (rhodopsin family) 259124 7 transmembrane receptor (rhodopsin family) 259125 7 transmembrane receptor (rhodopsin family) 259126 7 transmembrane receptor (rhodopsin family) 259172 CUB domain 259172 Low-density lipoprotein receptor domain class A 259173 Vacuolar sorting protein 9 (VPS9) domain 259173 Vacuolar sorting protein 9 (VPS9) domain 259217 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 259221 Oxysterol-binding protein 259229 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 259232 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 259233 Glutathione peroxidase 259235 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 259241 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 259241 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 259244 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 259245 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 259246 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 259247 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 259249 7 transmembrane receptor (rhodopsin family) 259271 HIT family 259273 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 259273 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 259279 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 259307 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 259307 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T 260293 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 260298 Ets-domain 260302 VHS domain. Domain present in VPS-27, Hrs and STAM 260302 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 260302 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 260319 7 transmembrane receptor (rhodopsin family) 260321 TPR Domain 260321 FKBP-type peptidyl-prolyl cis-trans isomerase 260323 PX domain. PX domains bind to phosphoinositides 260326 7 transmembrane receptor (Secretin family) 260326 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 260430 7 transmembrane receptor (metabotropic glutamate family) 260434 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 261726 TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway 261737 Tricarboxylate carrier 263803 Hr1 repeat 263803 Protein kinase domain 263803 Protein kinase C terminal domain 266459 Eukaryotic initiation factor 1A 266600 'Cold-shock' DNA-binding domain 266601 MAS20 protein import receptor 266602 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand 266603 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd 266604 Lipoxygenase 266604 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 266607 7 transmembrane receptor (rhodopsin family) 266608 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 266610 Death domain 266610 Death effector domain 266612 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 266629 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 266632 Death domain 266632 Protein kinase domain 266632 Protein kinase domain 266632 Protein kinase domain 266668 Uncharacterized protein family UPF0005 266674 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 266675 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 266680 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 266682 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 266684 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 266685 UDP-glucoronosyl and UDP-glucosyl transferase 266686 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 266688 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 266689 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 266690 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 266690 Oxidoreductase FAD-binding domain 266690 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 266690 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 266692 C2 domain 266692 C2 domain 266707 Connexin 266707 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 266709 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 266715 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 266717 Intermediate filament protein 266727 MAM domain. An extracellular domain found in many receptors 266729 Phosphotyrosine interaction domain (PTB/PID) 266735 7 transmembrane receptor (Secretin family) 266735 7 transmembrane receptor (Secretin family) 266735 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 266735 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 266738 Homeobox domain 266740 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 266758 Signal peptidase I 266759 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom 266760 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 266762 7 transmembrane receptor (rhodopsin family) 266763 Sodium:dicarboxylate symporter family 266765 Fibronectin type III domain 266765 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 266769 7 transmembrane receptor (rhodopsin family) 266771 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 266772 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 266773 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 266778 7 transmembrane receptor (metabotropic glutamate family) 266780 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 266781 PX domain. PX domains bind to phosphoinositides 266782 Somatotropin hormone family 266804 Somatotropin hormone family 266807 Animal haem peroxidase 266807 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou 266812 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 266813 Homeobox domain 266815 Class I Histocompatibility antigen, domains alpha 1 and 2 266815 Class I Histocompatibility antigen, domains alpha 1 and 2 266975 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 266977 7 transmembrane receptor (Secretin family) 266998 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 267019 Ribosomal protein S8 268278 Ribosomal protein S5, N-terminal domain 268278 Ribosomal protein S5, C-terminal domain 268281 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 268281 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 268281 Zinc finger, C3HC4 type (RING finger) 268281 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 268281 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 268289 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 268294 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 268294 Adenylate kinase 268294 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 268294 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 268297 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 268299 Helix-loop-helix DNA-binding domain 268299 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 268299 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 268301 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 268308 Ribosomal protein L19e 268311 Ribosomal L15 268312 ENV polyprotein (coat polyprotein) 268319 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 268320 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 268321 Nucleoside diphosphate kinase 268326 NB-ARC domain 268326 WD domain, G-beta repeat 268326 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 268329 Laminin N-terminal (Domain VI) 268329 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 268329 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 268330 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 268330 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 268336 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 268337 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 268340 PhoH-like protein. PhoH is a cytoplasmic protein and predicted ATPase that is induced by phosphate starvation 268345 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 268345 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 268347 Ribosomal protein S19 268353 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 268353 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 268358 Ribosomal protein S2 268360 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 268361 7 transmembrane receptor (rhodopsin family) 268363 7 transmembrane receptor (rhodopsin family) 268373 Cyclophilin type peptidyl-prolyl cis-trans isomerase 268379 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 268385 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 268385 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 268385 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 268395 Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA 268396 PX domain. PX domains bind to phosphoinositides 268396 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 268398 Plant PEC family metallothionein 268398 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 268398 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int 268406 Kinesin motor domain 268413 7 transmembrane receptor (rhodopsin family) 268415 Core histone H2A/H2B/H3/H4 268417 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 268424 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 268426 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 268428 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 268428 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 268429 Ribosomal protein L13e 268435 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins 268439 7 transmembrane receptor (rhodopsin family) 268444 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 268448 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 268448 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 268449 Ribosomal protein L23 268449 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 268452 Cyclin-dependent kinase 5 activator protein 268460 Animal haem peroxidase 268463 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 268467 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 268470 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack 268474 PH domain. PH stands for pleckstrin homology 268477 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 268480 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 268481 Intermediate filament protein 268482 Intermediate filament protein 268482 Intermediate filament protein 268484 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 268493 TPR Domain 268495 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 268495 Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF 268497 C2 domain 268497 Protein kinase domain 268497 Protein kinase C terminal domain 268497 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 268504 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 268512 Sulfate transporter family. Mutations may lead to several human diseases 268512 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 268515 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 268520 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 268522 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 268524 Ribosomal protein S19 268529 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 268539 Ribosomal protein S5, N-terminal domain 268542 BED zinc finger 268542 pfam02892, zf-BED, BED zinc finger 268543 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 268543 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 268544 Ribosomal protein S26e 268545 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 268545 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 268550 Ribosomal protein L13e 268552 Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 268552 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 268553 Cyclophilin type peptidyl-prolyl cis-trans isomerase 268555 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 268555 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 268557 Vinculin family 268564 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 268564 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 268565 Uncharacterised protein family (UPF0041) 268573 Ribosomal protein L35Ae 268576 Ribosomal protein L21e 268584 NAC domain 268589 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 268589 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 268591 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 268593 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 268602 HMG (high mobility group) box 268604 Protein kinase domain 268604 Protein kinase C terminal domain 268604 PH domain. PH stands for pleckstrin homology 268612 L1 transposable element 268612 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 268614 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 268622 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 268623 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 268625 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 268633 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 268644 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 268650 Ribosomal S17 268656 Aminotransferase class I and II 268657 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 268663 Cadherin domain 268665 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 268667 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 268676 Ribosomal protein L21e 268677 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 268681 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 268681 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 268686 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 268687 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 268695 Ribosomal protein L15 268700 Ribosomal S3Ae family 268706 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 268706 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 268708 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 268708 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 268713 PH domain. PH stands for pleckstrin homology 268714 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 268728 Sugar (and other) transporter 268728 LacY proton/sugar symporter. This family is closely related to the sugar transporter family 268729 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 268741 HMG (high mobility group) box 268742 Amino acid permease 268742 Tryptophan/tyrosine permease family 268746 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 268747 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 268752 WD domain, G-beta repeat 268752 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 268756 FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi 268757 Ribosomal protein S5, C-terminal domain 268762 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 268766 HMG (high mobility group) box 268767 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 268772 Glypican 268772 Uncharacterised protein (DUF314) 268776 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 268777 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 268777 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 268780 Fibronectin type III domain 268780 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 268780 Fibronectin type III domain 268780 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 268780 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 268782 Aminotransferase class-III 268790 PH domain. PH stands for pleckstrin homology 268790 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 268795 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 268798 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 268801 Translation initiation factor SUI1 268802 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 268807 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 268807 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet 268807 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 268807 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 268818 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 268831 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 268838 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 268843 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 268848 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 268855 Peptidase family C50 268857 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 268857 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 268859 Cullin family 268859 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 268860 Aminotransferase class-III 268869 7 transmembrane receptor (rhodopsin family) 268877 KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG 268879 Helix-loop-helix DNA-binding domain 268880 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 268882 F-box domain 268882 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 268884 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 268884 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 268885 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami 268886 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 268887 Ribosomal protein L6 268890 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 268891 Ribosomal protein S24e 268893 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 268895 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 268898 Ribosomal protein S21e 268899 Actin 268910 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 268921 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 268923 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 268923 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 268923 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 268923 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 268925 7 transmembrane receptor (metabotropic glutamate family) 268925 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 268927 7 transmembrane receptor (metabotropic glutamate family) 268928 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 268932 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 268933 WD domain, G-beta repeat 268935 CUB domain 268939 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 268940 Homeobox domain 268941 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 268941 Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved 268945 Class II histocompatibility antigen, beta domain 268945 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 268955 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 268958 Calpain family cysteine protease 268964 Ribosomal protein L21e 268967 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 268977 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 268977 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 268980 WD domain, G-beta repeat 268981 CBF/Mak21 family 268986 GTP1/OBG family 268986 GTPase of unknown function 268988 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 268992 HMG (high mobility group) box 269002 Ribosomal protein L21e 269010 L1 transposable element 269010 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 269019 Protein kinase domain 269035 Proteasome A-type and B-type 269036 Myosin head (motor domain) 269038 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 269043 Ribosomal protein S8e 269044 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 269045 7 transmembrane receptor (rhodopsin family) 269047 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 269047 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 269049 Calpain family cysteine protease 269049 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 269053 7 transmembrane receptor (rhodopsin family) 269063 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 269064 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 269064 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 269066 7 transmembrane receptor (rhodopsin family) 269069 7 transmembrane receptor (rhodopsin family) 269070 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 269075 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 269075 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 269079 Occludin/ELL family 269079 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 269079 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 269083 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 269085 Ribosomal L15 269086 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 269088 Homeobox domain 269091 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 269100 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 269103 Inward rectifier potassium channel 269109 Dienelactone hydrolase family 269109 Prolyl oligopeptidase family 269109 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 269109 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 269114 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic 269116 Fibronectin type III domain 269116 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 269121 Ribosomal protein S12 269132 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va 269132 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var 269137 Sushi domain (SCR repeat) 269140 SH2 domain 269146 Pentaxin family. Pentaxins are also known as pentraxins 269155 Ribosomal protein L21e 269156 Ribosomal protein L21e 269163 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 269166 Ribosomal protein L15 269173 Thrombospondin N-terminal -like domain 269173 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 269190 Ribosomal L15 269191 Ribosomal protein S24e 269193 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 269193 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 269198 Beige/BEACH domain 269200 Ribosomal L18ae protein family 269204 Tau and MAP protein, tubulin-binding repeat 269207 Ribosomal L15 269211 ENV polyprotein (coat polyprotein) 269211 ENV polyprotein (coat polyprotein) 269213 L1 transposable element 269213 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 269224 Protein kinase domain 269224 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 269227 Phosphatidylinositol transfer protein 269229 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 269229 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 269229 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 269230 Intermediate filament protein 269232 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 269242 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 269242 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 269243 Protein kinase domain 269261 Ribosomal protein L11, RNA binding domain 269261 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 269265 Ribosomal S3Ae family 269267 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 269268 ENV polyprotein (coat polyprotein) 269275 Protein kinase domain 269275 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t 269278 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 269280 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 269282 Protein kinase domain 269283 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 269283 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 269283 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 269285 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 269289 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 269297 7 transmembrane receptor (rhodopsin family) 269300 7 transmembrane receptor (rhodopsin family) 269308 7 transmembrane receptor (rhodopsin family) 269309 7 transmembrane receptor (rhodopsin family) 269310 7 transmembrane receptor (rhodopsin family) 269311 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 269311 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 269312 7 transmembrane receptor (rhodopsin family) 269316 Fibronectin type III domain 269316 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 269319 Enolase, N-terminal domain 269319 Enolase, C-terminal TIM barrel domain 269325 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 269325 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 269338 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 269338 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 269338 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 269346 Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin 269353 Cyclophilin type peptidyl-prolyl cis-trans isomerase 269354 Transglutaminase family 269354 Transglutaminase family, C-terminal ig like domain 269354 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 269356 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 269361 HMG (high mobility group) box 269363 Ribosomal L15 269364 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269365 Ribosomal protein S19e 269366 Uncharacterized ACR, COG1579 269366 Intermediate filament protein 269366 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 269367 Ribosomal protein L35Ae 269369 Cyclophilin type peptidyl-prolyl cis-trans isomerase 269369 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 269371 Ribosomal protein L21e 269373 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 269378 S-adenosyl-L-homocysteine hydrolase 269378 S-adenosyl-L-homocysteine hydrolase 269378 Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases 269378 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 269380 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 269380 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 269380 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family 269380 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 269384 LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 269384 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 269384 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 269406 Protein kinase domain 269406 ENV polyprotein (coat polyprotein) 269410 Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 269410 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 269413 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 269414 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269415 Myosin head (motor domain) 269415 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 269415 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 269416 Ribosomal protein L15 269425 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 269427 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 269432 Ribosomal protein S17 269433 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269434 ENV polyprotein (coat polyprotein) 269435 Ribosomal protein S12 269444 Sodium / potassium ATPase beta chain 269445 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 269452 Extracellular link domain 269452 Lectin C-type domain. This family includes both long and short form C-type 269452 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 269452 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 269466 Thi4 family. This family includes a putative thiamine biosynthetic enzyme 269466 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 269466 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 269466 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 269470 Dip2/Utp12 Family. This domain is found at the C-terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein the yeast protein is called Utp12 or DIP2 269470 WD domain, G-beta repeat 269473 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 269473 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 269478 Ribosomal protein L13e 269485 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protein 269487 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 269497 SH2 domain 269497 PH domain. PH stands for pleckstrin homology 269501 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 269501 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 269504 7 transmembrane receptor (Secretin family) 269504 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 269508 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 269508 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269511 HMG (high mobility group) box 269512 Glutamine amidotransferase class-I 269514 F-box domain 269523 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o 269523 Bacterial dnaA protein 269523 Sigma-54 interaction domain 269523 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 269523 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 269523 Parvovirus non-structural protein NS1. This family also contains the NS2 protein. Parvoviruses encode two non-structural proteins, NS1 and NS2. The mRNA for NS2 contains the coding sequence for the first 87 amino acids of NS1, then by an alternative splic 269523 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist of 269523 Bacterial dnaA protein 269523 Sigma-54 interaction domain 269523 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 269523 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 269523 Parvovirus non-structural protein NS1. This family also contains the NS2 protein. Parvoviruses encode two non-structural proteins, NS1 and NS2. The mRNA for NS2 contains the coding sequence for the first 87 amino acids of NS1, then by an alternative splic 269523 Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist of 269527 L1 transposable element 269527 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 269529 F-box domain 269533 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 269534 Ribosomal S17 269538 Ribosomal L29e protein family 269540 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 269541 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 269542 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 269548 Ribosomal protein S6e 269552 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 269558 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 269569 Cyclophilin type peptidyl-prolyl cis-trans isomerase 269579 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 269585 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 269587 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 269589 C2 domain 269589 C2 domain 269590 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 269591 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269593 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 269594 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 269595 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 269597 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 269604 7 transmembrane receptor (Secretin family) 269604 7 transmembrane receptor (Secretin family) 269604 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 269608 PH domain. PH stands for pleckstrin homology 269608 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 269611 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 269611 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269611 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 269611 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 269614 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor 269615 PH domain. PH stands for pleckstrin homology 269615 PH domain. PH stands for pleckstrin homology 269615 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 269620 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269622 Ribosomal family S4e 269625 Intermediate filament protein 269628 S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 269628 S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 269635 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 269636 Homeobox domain 269642 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 269650 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269664 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 269666 L1 transposable element 269666 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 269674 Ribosomal protein L6 269691 14-3-3 protein 269697 Homeobox domain 269697 LIM domain. This family represents two copies of the LIM structural domain 269700 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 269709 Protein kinase domain 269713 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 269713 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 269713 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 269713 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 269715 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 269715 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 269715 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 269719 Ribosomal protein S7e 269725 Elongation factor 1 gamma, conserved domain 269725 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 269732 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 269732 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 269735 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 269740 Homeobox domain 269742 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 269743 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 269744 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 269745 7 transmembrane receptor (rhodopsin family) 269745 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 269750 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 269751 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 269754 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 269754 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 269756 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 269758 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 269758 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269758 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 269758 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 269758 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 269763 Tubulin binding cofactor A 269765 HMG (high mobility group) box 269769 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 269769 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 269771 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 269771 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 269772 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 269773 Protein kinase domain 269783 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 269784 Fibronectin type III domain 269786 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 269792 Core histone H2A/H2B/H3/H4 269798 Phosphoribulokinase / Uridine kinase family 269799 Lectin C-type domain. This family includes both long and short form C-type 269804 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 269804 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 269807 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 269813 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 269813 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 269819 L1 transposable element 269819 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 269823 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. 269830 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 269831 Tetraspanin family 269836 von Willebrand factor type D domain 269836 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 269841 ENV polyprotein (coat polyprotein) 269845 Fructose-bisphosphate aldolase class-I 269845 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 269855 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 269856 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 269857 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 269858 7 transmembrane receptor (rhodopsin family) 269858 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 269859 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 269862 7 transmembrane receptor (rhodopsin family) 269863 ENV polyprotein (coat polyprotein) 269863 7 transmembrane receptor (rhodopsin family) 269863 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 269863 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 269863 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 269863 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 269863 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 269864 WD domain, G-beta repeat 269865 Translation initiation factor SUI1 269870 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 269870 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 269870 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 269870 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 269870 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 269871 Sulfotransferase protein 269873 Ribosomal S3Ae family 269874 Sodium:dicarboxylate symporter family 269877 Ets-domain 269881 Protein kinase domain 269886 PH domain. PH stands for pleckstrin homology 269886 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 269894 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 269894 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 269902 7 transmembrane receptor (metabotropic glutamate family) 269902 7 transmembrane receptor (metabotropic glutamate family) 269905 Trypsin 269906 Trypsin 269907 Trypsin 269913 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 269919 7 transmembrane receptor (rhodopsin family) 269920 ENV polyprotein (coat polyprotein) 269928 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 269932 SH2 domain 269932 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 269939 Guanylate kinase 269939 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 269939 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 269940 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 269941 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 269943 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 269945 Sushi domain (SCR repeat) 269945 Extracellular link domain 269945 Lectin C-type domain. This family includes both long and short form C-type 269948 Translation initiation factor SUI1 269949 Helix-loop-helix DNA-binding domain 269954 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 269957 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 269959 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 269968 7 transmembrane receptor (rhodopsin family) 269969 7 transmembrane receptor (rhodopsin family) 269971 7 transmembrane receptor (rhodopsin family) 269972 7 transmembrane receptor (rhodopsin family) 269973 7 transmembrane receptor (rhodopsin family) 269974 7 transmembrane receptor (rhodopsin family) 269980 Ubiquitin carboxyl-terminal hydrolase family 2 269981 7 transmembrane receptor (rhodopsin family) 269982 7 transmembrane receptor (rhodopsin family) 269989 KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG 269990 Eukaryotic porin 269994 PMP-22/EMP/MP20/Claudin family 269997 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 270002 metallopeptidase family M24 270004 Fork head domain 270006 Ribosomal protein L24e 270009 7 transmembrane receptor (rhodopsin family) 270014 Protein kinase domain 270017 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 270017 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270017 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 270017 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 270017 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 270026 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 270026 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 270030 Ribosomal S3Ae family 270033 Ribosomal protein L19e 270038 GTP1/OBG family 270038 GTPase of unknown function 270038 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 270040 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 270046 PMP-22/EMP/MP20/Claudin family 270049 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 270049 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 270055 Sushi domain (SCR repeat) 270055 Extracellular link domain 270055 Lectin C-type domain. This family includes both long and short form C-type 270055 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 270055 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o 270056 3'5'-cyclic nucleotide phosphodiesterase 270057 Fibronectin type III domain 270082 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 270083 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 270089 Carboxylesterase 270093 Ribosomal protein S24e 270096 SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs 270099 Ribosomal S3Ae family 270102 Amino acid permease 270104 Translation initiation factor SUI1 270105 Cadherin domain 270105 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 270106 Ribosomal protein L13e 270106 Ribosomal protein L13e 270109 Competence protein. Members of this family are integral membrane proteins with 6 predicted transmembrane helices. Some members of this family have been shown to be essential for bacterial competence in uptake of extracellular DNA. These proteins may tran 270112 CUB domain 270112 MAM domain. An extracellular domain found in many receptors 270112 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 270119 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 270121 Ribosomal protein L21e 270140 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270144 7 transmembrane receptor (rhodopsin family) 270145 7 transmembrane receptor (rhodopsin family) 270147 ENV polyprotein (coat polyprotein) 270147 7 transmembrane receptor (rhodopsin family) 270147 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 270148 Ribosomal protein S7e 270149 Zona pellucida-like domain 270149 von Willebrand factor type D domain 270149 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 270151 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma 270155 NNMT/PNMT/TEMT family 270160 ADP-ribosylation factor family 270160 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 270163 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 270164 ADP-ribosylation factor family 270166 Sigma-54 interaction domain 270166 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 270169 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270174 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 270174 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 270183 DedA family. This family combines the DedA related proteins and YIAN/YGIK family. Members of this family are not functionally characterised. These proteins contain multiple predicted transmembrane regions 270187 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 270190 Fibronectin type III domain 270190 Giardia variant-specific surface protein 270190 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 270192 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 270192 ADP-ribosylation factor family 270192 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 270192 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 270193 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 270198 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 270201 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 270214 Peptidase family C54 270220 Ribosomal protein L23 270230 Core histone H2A/H2B/H3/H4 270231 Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG 270234 Ribosomal protein L23 270236 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270243 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 270243 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 270251 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270263 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 270275 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 270277 ENV polyprotein (coat polyprotein) 270289 7 transmembrane receptor (rhodopsin family) 270289 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 270309 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 270309 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 270319 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 270343 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 270344 Homeobox domain 270355 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 270358 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270362 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 270363 UDP-glucoronosyl and UDP-glucosyl transferase 270363 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 270364 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 270366 Hsp90 protein 270366 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 270372 Ribosomal protein L19e 270379 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270381 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 270384 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 270392 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 270407 Ribosomal protein L21e 270409 Ribosomal protein L15 270417 Intermediate filament protein 270417 Laminin N-terminal (Domain VI) 270417 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 270417 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 270417 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 270417 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 270425 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 270426 7 transmembrane receptor (rhodopsin family) 270426 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 270434 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270437 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 270448 7 transmembrane receptor (rhodopsin family) 270461 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 270468 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 270470 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270473 7 transmembrane receptor (rhodopsin family) 270491 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 270498 7 transmembrane receptor (rhodopsin family) 270499 Ribosomal protein L23 270500 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 270502 7 transmembrane receptor (rhodopsin family) 270502 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 270504 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 270504 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 270506 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 270507 7 transmembrane receptor (rhodopsin family) 270529 Protein kinase domain 270535 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270539 7 transmembrane receptor (rhodopsin family) 270541 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 270566 Ribosomal protein L23 270567 Ribosomal protein L23 270575 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 270580 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 270582 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270582 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 270584 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 270588 L1 transposable element 270589 L1 transposable element 270589 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 270596 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 270600 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 270602 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270612 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 270612 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 270617 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 270617 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 270619 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 270621 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 270624 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members 270624 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 270625 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 270627 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 270636 ADP-ribosylation factor family 270636 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 270637 Ribosomal protein L11, RNA binding domain 270639 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 270639 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 270639 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 270639 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 270640 Hsp90 protein 270643 ENV polyprotein (coat polyprotein) 270643 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 270643 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 270658 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 270658 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 270660 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 270665 HMG (high mobility group) box 270667 Zinc finger, C3HC4 type (RING finger) 270669 Peptidase family M50 270670 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 270672 Protein kinase domain 270685 Formate--tetrahydrofolate ligase 270734 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 270742 Ribosomal protein L14p/L23e 270749 Trypsin 270757 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 270757 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 270757 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 270763 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 270764 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 270784 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 270804 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 270822 Actin 270829 Ribosomal protein S2 270984 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 270984 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 270984 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 270986 Ribosomal protein S5, C-terminal domain 270986 Ribosomal protein S5, N-terminal domain 270988 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 270997 Homeobox domain 270999 Actin 271003 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 271005 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 271021 Core histone H2A/H2B/H3/H4 271022 Eukaryotic initiation factor 1A 271041 Ribosomal L29e protein family 271047 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 271048 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 271068 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 271115 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 271124 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 271127 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 271127 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 271133 HMG (high mobility group) box 271142 Ribosomal protein S8 271147 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 271171 Triosephosphate isomerase 271173 Thymidylate synthase 271202 HMG (high mobility group) box 271215 Phosphatidylethanolamine-binding protein 271228 Ribosomal protein L36e 271264 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 271268 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 271278 Growth-Arrest-Specific Protein 2 Domain 271280 WD domain, G-beta repeat 271294 Ribosomal L29e protein family 271297 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 271299 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 271300 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 271305 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 271350 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 271350 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 271363 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 271364 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 271374 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 271374 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 271375 Cation transport protein. This family consists of various cation transport proteins (Trk) and V-type sodium ATP synthase subunit J or translocating ATPase J EC:3.6.1.34. These proteins are involved in active sodium up-take utilizing ATP in the process. Tr 271395 Kinesin motor domain 271409 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 271415 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 271424 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 271427 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 271427 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 271444 Class I Histocompatibility antigen, domains alpha 1 and 2 271444 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 271451 LIM domain. This family represents two copies of the LIM structural domain 271457 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 271463 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 271489 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 271489 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 271489 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 271490 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 271490 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 271490 Glucuronate isomerase. This is a family of Glucuronate isomerases also known as D-glucuronate isomerase, uronic isomerase, uronate isomerase, or uronic acid isomerase, EC:5.3.1.12. This enzyme catalyses the reactions: D-glucuronate <=> D-fructuronate and 271501 Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the seque 271505 'Cold-shock' DNA-binding domain 271508 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 271513 Ribosomal protein S28e 271519 DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after 271521 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 271529 Proteasome A-type and B-type 271591 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 271591 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 271634 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 271658 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 271682 Ribosomal protein L6 271697 Protein kinase domain 271707 Protein kinase domain 271708 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 271709 Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain 271786 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 271844 C2 domain 271844 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 271942 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 271944 C2 domain 271945 Ribosomal protein S5, C-terminal domain 271970 Sulfatase 271976 Eukaryotic ribosomal protein L18 271978 Ribosomal protein S28e 272002 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 272009 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 272031 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 272129 Zinc finger, C3HC4 type (RING finger) 272138 ENV polyprotein (coat polyprotein) 272143 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 272144 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 272149 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 272155 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 272158 DNA polymerase family A 272266 Clathrin adaptor complex small chain 272284 Protein kinase domain 272294 Peptidase family C25 272294 Ribosomal protein S8 272311 Lectin C-type domain. This family includes both long and short form C-type 272322 Helix-loop-helix DNA-binding domain 272331 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 272339 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 272341 Trypsin 272350 Galactoside-binding lectin 272355 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 272355 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 272357 7 transmembrane receptor (rhodopsin family) 272357 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 272378 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 272381 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 272381 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 272382 Ets-domain 272391 HMG (high mobility group) box 272407 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 272411 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 272417 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 272418 7 transmembrane receptor (rhodopsin family) 272422 HMG (high mobility group) box 272428 AMP-binding enzyme 272437 Sulfotransferase protein 272439 Virulence factor MVIN. The MVIN protein is a putative integral membrane protein. The function is unknown 272443 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 272443 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 272443 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 272462 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 272511 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 272518 Ribosomal L39 protein 272551 Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.` 272557 Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 272557 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 272603 Ribosomal protein L21e 272633 Mitochondrial carrier protein 272636 C2 domain 272636 C2 domain 272643 Trypsin 272661 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 272667 Protein kinase domain 272667 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 272668 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 272671 Eukaryotic initiation factor 1A 272673 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 272680 Protein kinase domain 272681 Ribosomal protein S17 272683 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 272684 HMG (high mobility group) box 272691 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 272692 Eukaryotic initiation factor 1A 272699 Eukaryotic initiation factor 1A 272701 Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <=> phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in t 272723 7 transmembrane receptor (rhodopsin family) 272740 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 272771 ENV polyprotein (coat polyprotein) 272771 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 272790 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 272811 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 276720 Inward rectifier potassium channel 276731 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 276740 7 transmembrane receptor (rhodopsin family) 276742 7 transmembrane receptor (rhodopsin family) 276743 7 transmembrane receptor (rhodopsin family) 276756 Hsp90 protein 276756 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 276760 Ribosomal protein L14p/L23e 276764 7 transmembrane receptor (rhodopsin family) 276770 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 276792 Thymosin beta-4 family 276803 Actin 276804 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 276829 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 276830 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 276834 NAC domain 276837 PH domain. PH stands for pleckstrin homology 276841 Cyclophilin type peptidyl-prolyl cis-trans isomerase 276843 Hsp90 protein 276858 7 transmembrane receptor (rhodopsin family) 276860 7 transmembrane receptor (rhodopsin family) 276864 7 transmembrane receptor (rhodopsin family) 276865 7 transmembrane receptor (rhodopsin family) 276872 Respiratory-chain NADH dehydrogenase, 30 Kd subunit 276877 7 transmembrane receptor (rhodopsin family) 276889 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 276905 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 276905 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 276925 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 276929 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 276929 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 276931 7 transmembrane receptor (rhodopsin family) 276941 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 276995 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 276995 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 276998 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 277001 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 277015 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 277015 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 277016 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 277016 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 277018 GTPase of unknown function 277018 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 277018 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 277027 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 277028 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277035 Protein kinase domain 277041 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 277046 L1 transposable element 277046 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277047 L1 transposable element 277047 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277049 Eukaryotic initiation factor 1A 277051 L1 transposable element 277051 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277069 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277089 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 277097 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 277099 Core histone H2A/H2B/H3/H4 277099 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 277103 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 277104 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 277106 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 277108 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 277111 Core histone H2A/H2B/H3/H4 277125 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 277125 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 277128 ENV polyprotein (coat polyprotein) 277131 Domain of unknown function 277131 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 277135 ENV polyprotein (coat polyprotein) 277135 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277136 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277137 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277145 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277147 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277148 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277151 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277154 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277160 7 transmembrane receptor (metabotropic glutamate family) 277161 ENV polyprotein (coat polyprotein) 277161 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 277161 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277161 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 277161 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277164 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 277170 ENV polyprotein (coat polyprotein) 277183 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 277193 ENV polyprotein (coat polyprotein) 277200 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277225 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 277225 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 277226 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 277226 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 277226 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 277231 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277231 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277231 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277231 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277231 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277234 L1 transposable element 277234 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277236 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277236 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277250 jmjC domain 277258 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 277270 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277270 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277274 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277274 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277278 L1 transposable element 277278 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277281 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277281 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277283 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277283 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277288 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 277313 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 277318 ENV polyprotein (coat polyprotein) 277328 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 277328 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 277328 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 277328 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 277333 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 277333 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 277343 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 277378 ENV polyprotein (coat polyprotein) 277383 7 transmembrane receptor (rhodopsin family) 277389 Cyclophilin type peptidyl-prolyl cis-trans isomerase 277395 Putative zinc finger in N-recognin 277396 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 277396 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 277396 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 277419 7 transmembrane receptor (rhodopsin family) 277481 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 277482 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 277482 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 277489 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 277494 Ribosomal protein L15 277498 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 277511 7 transmembrane receptor (rhodopsin family) 277561 7 transmembrane receptor (rhodopsin family) 277562 7 transmembrane receptor (rhodopsin family) 277566 7 transmembrane receptor (rhodopsin family) 277567 7 transmembrane receptor (rhodopsin family) 277577 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 277603 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 277617 Ribosomal protein L21e 277618 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277618 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277618 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277618 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277636 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 277637 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 277657 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 277692 Ribosomal protein S19e 277704 Protein kinase domain 277704 Fibronectin type III domain 277704 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 277704 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 277709 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277709 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277709 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277709 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277752 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 277753 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 277753 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 277754 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 277757 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 277757 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277768 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 277771 Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 277771 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 277785 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277785 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277785 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277785 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277786 Respiratory-chain NADH dehydrogenase, 30 Kd subunit 277792 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 277792 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 277792 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 277799 Eukaryotic-type carbonic anhydrase 277807 ADP-ribosylation factor family 277807 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 277819 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277819 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277819 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277819 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277820 ENV polyprotein (coat polyprotein) 277830 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 277837 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277837 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277854 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 277856 Mitochondrial carrier protein 277858 NAD-dependent glycerol-3-phosphate dehydrogenase 277860 Zinc finger, C3HC4 type (RING finger) 277860 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 277860 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 277868 Actin 277872 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 277879 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277879 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277881 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 277885 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277885 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277885 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277885 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277898 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 277905 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 277910 Lectin C-type domain. This family includes both long and short form C-type 277916 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 277916 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 277916 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 277916 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 277922 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277922 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277923 L1 transposable element 277923 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277923 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277923 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277924 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277925 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277925 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277935 7 transmembrane receptor (rhodopsin family) 277943 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 277949 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 277949 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 277955 Ribosomal protein L21e 277956 Hsp90 protein 277973 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 277979 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 277982 L1 transposable element 277982 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 277982 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 277982 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 277990 7 transmembrane receptor (rhodopsin family) 278002 ENV polyprotein (coat polyprotein) 278002 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 278020 F-actin capping protein alpha subunit 278028 L1 transposable element 278028 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 278037 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 278037 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 278041 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 278041 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 278041 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 278050 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 p 278061 ENV polyprotein (coat polyprotein) 278061 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 278062 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 278062 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 278067 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 278078 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 278078 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 278078 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 278078 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 278085 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 278087 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 278102 ENV polyprotein (coat polyprotein) 278103 HMG (high mobility group) box 278105 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 278125 BTG1 family. A novel family of anti-proliferative proteins 278129 Polyprenyl synthetase 278143 Ribosomal protein L19 278143 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 278143 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 278145 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 278148 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 278167 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 278170 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 278170 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 278174 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 278178 Pyruvate kinase, alpha/beta domain 278178 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 278178 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 278179 Eukaryotic porin 278179 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 278188 WD domain, G-beta repeat 278192 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 278194 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 278197 Aldose 1-epimerase 278203 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 278209 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 278211 ENV polyprotein (coat polyprotein) 278214 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 278215 Zinc finger, C3HC4 type (RING finger) 278216 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 278227 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 278232 C2 domain 278240 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 278244 L1 transposable element 278244 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 278244 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 278244 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 278248 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 278263 L1 transposable element 278263 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 278265 L1 transposable element 278265 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 278265 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 278265 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 278266 L1 transposable element 278267 L1 transposable element 278268 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 278269 Mitosis protein DIM1 278279 TPR Domain 278319 Ribosomal protein S28e 278350 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 278350 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 278365 Ribosomal protein S5, C-terminal domain 278417 Eukaryotic ribosomal protein L18 278420 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 278420 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 278420 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 278473 Uncharacterized ACR, COG1579 278473 Intermediate filament protein 278473 Myosin head (motor domain) 278473 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 278473 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 278473 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 278507 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 278507 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 278581 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 278583 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 278603 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 278616 WD domain, G-beta repeat 278672 Homeobox domain 278718 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 278725 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 278748 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 278748 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 278748 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 278748 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 278757 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 278759 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 278759 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 278759 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 278786 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 278791 Fibronectin type III domain 278817 ENV polyprotein (coat polyprotein) 278876 ENV polyprotein (coat polyprotein) 278876 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 278876 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 278876 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 278876 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 278915 ENV polyprotein (coat polyprotein) 278986 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 279013 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 279013 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 279028 Thrombospondin type 1 domain 279028 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 279029 Protein kinase domain 279034 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 279050 7 transmembrane receptor (rhodopsin family) 279056 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 279057 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 279067 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 279071 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 279137 7 transmembrane receptor (metabotropic glutamate family) 279137 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 279138 7 transmembrane receptor (metabotropic glutamate family) 279138 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 279139 Glycosyl hydrolases family 39 279139 7 transmembrane receptor (metabotropic glutamate family) 279140 7 transmembrane receptor (metabotropic glutamate family) 279140 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 279181 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 279181 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 279184 ADP-ribosylation factor family 279184 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 279230 Skp1 family, dimerisation domain 279230 Skp1 family, tetramerisation domain 279260 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 279333 Protein kinase domain 279387 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 279414 Ribosomal protein S8e 279421 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M 279429 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279430 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279432 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279464 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 279464 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 279472 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 279475 HMG (high mobility group) box 279499 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 279499 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 279500 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 279553 Homeobox domain 279554 Homeobox domain 279569 Ribosomal protein L21e 279572 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 279578 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 279578 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 279610 Homeobox domain 279639 Ribosomal protein L6e 279639 Ribosomal protein L6, N-terminal domain 279639 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 279643 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 279653 Cadherin domain 279656 HMG (high mobility group) box 279658 Ribosomal protein S2 279679 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 279684 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 279686 ENV polyprotein (coat polyprotein) 279688 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 279691 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 279691 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 279730 L1 transposable element 279730 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 279730 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 279730 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 279730 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 279766 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 279766 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 279796 7 transmembrane receptor (rhodopsin family) 279851 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279857 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279872 L1 transposable element 279872 Intermediate filament protein 279872 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 279872 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279872 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 279872 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 279881 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279882 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 279887 L1 transposable element 279887 Intermediate filament protein 279887 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 279887 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 279887 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 279896 Protein kinase domain 279919 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 279922 L1 transposable element 279922 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 279922 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 279923 L1 transposable element 279923 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 279923 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 279926 L1 transposable element 279926 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 279934 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 279934 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 279935 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 279939 Class I Histocompatibility antigen, domains alpha 1 and 2 279940 7 transmembrane receptor (rhodopsin family) 279942 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 279947 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 279947 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 279953 7 transmembrane receptor (rhodopsin family) 279963 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 279991 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 280026 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 280043 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 280046 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 280046 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 280047 Ribosomal protein L13e 280078 Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relate 280093 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 280093 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 280093 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 280095 L1 transposable element 280096 L1 transposable element 280096 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 280097 L1 transposable element 280099 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 280112 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 280118 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 280121 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 280123 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 280132 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 280132 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 280133 L1 transposable element 280133 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 280156 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 280184 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 280184 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 280188 GH3 auxin-responsive promoter 280188 Repeat in ubiquitin-activating (UBA) protein 280188 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 280205 Ribosomal protein L6 280219 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 280316 ENV polyprotein (coat polyprotein) 280412 Ribosomal protein L21e 280413 von Willebrand factor type A domain 280413 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 280415 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 280420 Ribosomal L18ae protein family 280444 ENV polyprotein (coat polyprotein) 280444 Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly 280444 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 280444 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 280444 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 280458 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 280458 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 280458 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u 280487 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 280487 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 280487 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 280487 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 280487 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 280487 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 280487 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 280487 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 280522 L1 transposable element 280522 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 280523 L1 transposable element 280523 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 280553 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 280553 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 280568 L1 transposable element 280568 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 280575 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 280645 Glycosyltransferase family 43 280662 Serum albumin family 280667 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 280667 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 280668 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 280668 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 280671 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 282546 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 282547 7 transmembrane receptor (rhodopsin family) 282581 7 transmembrane receptor (Secretin family) 282581 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 282582 wnt family 282587 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 282636 Inositol monophosphatase family 282708 Serum albumin family 282712 Sodium:neurotransmitter symporter family 282817 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 282819 Cyclophilin type peptidyl-prolyl cis-trans isomerase 282821 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 282821 Chitin synthase. Members of this family are fungal chitin synthase EC:2.4.1.16 enzymes. They catalyse chitin synthesis as follows: UDP-N-acetyl-D-glucosamine + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1) 282824 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 282825 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 282832 7 transmembrane receptor (rhodopsin family) 282832 7 transmembrane receptor (rhodopsin family) 282833 Zona pellucida-like domain 282834 Eukaryotic protein of unknown function, DUF279 282836 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 282836 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 282839 7 transmembrane receptor (rhodopsin family) 282892 7 transmembrane receptor (rhodopsin family) 282893 7 transmembrane receptor (rhodopsin family) 282894 7 transmembrane receptor (rhodopsin family) 282896 7 transmembrane receptor (rhodopsin family) 282897 7 transmembrane receptor (rhodopsin family) 282898 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 282898 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 282901 Ribosomal S17 282907 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 282910 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 282925 Ribosomal protein S2 282929 7 transmembrane receptor (rhodopsin family) 282930 7 transmembrane receptor (rhodopsin family) 282933 7 transmembrane receptor (rhodopsin family) 282934 7 transmembrane receptor (rhodopsin family) 282935 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 282935 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 282938 Ribosomal S17 282947 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 282959 Class II histocompatibility antigen, beta domain 282959 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 282978 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 282982 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 282985 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex 282988 Ribosomal protein L13 282998 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 282998 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 283002 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 283002 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 283004 Prenyltransferase and squalene oxidase repeat 283011 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic 283011 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 283018 PH domain. PH stands for pleckstrin homology 283018 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 283018 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 283021 PH domain. PH stands for pleckstrin homology 283021 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 283021 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 283023 Prenyltransferase and squalene oxidase repeat 283037 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 283037 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 283039 Homeobox domain 283043 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 283043 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 283044 PH domain. PH stands for pleckstrin homology 283044 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 283054 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 283056 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 283082 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 283092 7 transmembrane receptor (rhodopsin family) 283093 7 transmembrane receptor (rhodopsin family) 283097 7 transmembrane receptor (rhodopsin family) 283100 7 transmembrane receptor (rhodopsin family) 283101 7 transmembrane receptor (rhodopsin family) 283102 Intermediate filament protein 283103 Ribosomal protein L23 283109 7 transmembrane receptor (rhodopsin family) 283110 7 transmembrane receptor (rhodopsin family) 283111 7 transmembrane receptor (rhodopsin family) 283114 S25 ribosomal protein 283115 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 283116 Zinc finger, C3HC4 type (RING finger) 283116 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283117 Zinc finger, C3HC4 type (RING finger) 283117 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283130 Mitochondrial carrier protein 283130 Mitochondrial carrier protein 283137 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 283150 Fork head domain 283155 Protein kinase domain 283158 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 283159 7 transmembrane receptor (rhodopsin family) 283160 7 transmembrane receptor (rhodopsin family) 283162 7 transmembrane receptor (rhodopsin family) 283176 Glycosyl hydrolases family 35 283180 NADH dehydrogenase 283181 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283188 7 transmembrane receptor (rhodopsin family) 283189 7 transmembrane receptor (rhodopsin family) 283193 7 transmembrane receptor (rhodopsin family) 283198 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic 283202 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 283208 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 283210 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 283219 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 283222 Ribosomal protein S28e 283226 Protein prenyltransferase alpha subunit repeat 283238 Sugar (and other) transporter 283240 Sugar (and other) transporter 283247 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 283248 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E 283256 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283257 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283258 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283259 Zinc finger, C3HC4 type (RING finger) 283259 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283264 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 283265 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch 283283 Mitochondrial carrier protein 283286 Formate--tetrahydrofolate ligase 283287 7 transmembrane receptor (rhodopsin family) 283288 7 transmembrane receptor (rhodopsin family) 283297 7 transmembrane receptor (rhodopsin family) 283298 Olfactomedin-like domain 283312 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 283320 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 283321 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 283326 Lectin C-type domain. This family includes both long and short form C-type 283340 Ribosomal protein L13 283341 Adenylate kinase 283342 Core histone H2A/H2B/H3/H4 283343 Arginosuccinate synthase. This family contains a PP-loop motif 283345 Ribosomal protein L13e 283349 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 283349 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 283358 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 283365 7 transmembrane receptor (rhodopsin family) 283366 7 transmembrane receptor (rhodopsin family) 283367 7 transmembrane receptor (rhodopsin family) 283368 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 283375 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the 283377 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 283383 7 transmembrane receptor (Secretin family) 283398 ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity 283398 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 283415 Uncharacterized protein PaaI, COG2050 283415 Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins 283415 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-Co 283418 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 283419 Lectin C-type domain. This family includes both long and short form C-type 283420 Lectin C-type domain. This family includes both long and short form C-type 283421 Ribosomal protein L13 283425 Skp1 family, dimerisation domain 283425 Skp1 family, tetramerisation domain 283431 Growth-Arrest-Specific Protein 2 Domain 283455 Protein kinase domain 283458 Nucleoside diphosphate kinase 283459 Glu-tRNAGln amidotransferase C subunit. This is a family of Glu-tRNAGln amidotransferase C subunits. The Glu-tRNA Gln amidotransferase enzyme itself is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correc 283464 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 283465 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 283465 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 283466 Ribosomal protein S27 283471 Trypsin 283471 Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are 283479 Ribosomal protein S26e 283490 Ribosomal protein L13 283493 7 transmembrane receptor (rhodopsin family) 283494 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 283509 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 283510 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 283510 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 283514 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 283518 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 283523 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 283529 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 283535 7 transmembrane receptor (rhodopsin family) 283539 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 283548 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 283549 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 283558 Ribosomal protein L31e 283600 Mitochondrial carrier protein 283615 7 transmembrane receptor (rhodopsin family) 283617 7 transmembrane receptor (rhodopsin family) 283618 7 transmembrane receptor (rhodopsin family) 283620 7 transmembrane receptor (rhodopsin family) 283621 7 transmembrane receptor (rhodopsin family) 283623 7 transmembrane receptor (rhodopsin family) 283632 7 transmembrane receptor (rhodopsin family) 283634 Ribosomal protein L23 283652 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 283660 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 283675 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 283693 Actin 283694 7 transmembrane receptor (rhodopsin family) 283695 Ribosomal protein S8e 283698 Synaptogyrin. This family of proteins is distantly related to pfam01284 283704 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 283706 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 283716 Cyclic nucleotide-binding domain 283716 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 283719 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 283727 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 283741 Homeobox domain 283747 Skp1 family, tetramerisation domain 283748 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso 283748 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 283754 Protein kinase domain 283756 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 283801 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 283804 Disintegrin 283806 Protein kinase domain 283819 Ribosomal L15 283848 Carboxylesterase 283853 Uracil DNA glycosylase superfamily 283855 Tropomyosin 283870 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 283870 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 283871 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 283878 Acyl CoA binding protein 283903 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 283912 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 283922 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 283925 SCP-like extracellular protein. This domain is also found in prokaryotes 283926 Chaperonin 10 Kd subunit 283927 NUDIX domain 283940 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 283955 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 283955 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 283957 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 283964 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 283970 Pyridoxal-dependent decarboxylase conserved domain 283985 Fatty acid desaturase 283985 Fatty acid desaturase 284013 Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furt 284013 Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furth 284015 ATP synthase subunit C 284024 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 284037 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 284054 Phosphotyrosine interaction domain (PTB/PID) 284057 Phosphotyrosine interaction domain (PTB/PID) 284064 Ribosomal L29e protein family 284076 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 284078 Eukaryotic initiation factor 4E 284085 Intermediate filament protein 284086 Protein kinase domain 284087 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 284089 Intermediate filament protein 284100 14-3-3 protein 284111 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 284121 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 284125 ATP synthase A chain 284125 Cytochrome b(N-terminal)/b6/petB 284125 Cytochrome C oxidase subunit II, periplasmic domain 284144 Ribosomal protein L13e 284161 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b 284161 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 284162 Yippee putative zinc-binding protein 284164 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 284165 Clathrin adaptor complex small chain 284165 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 284166 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 284167 DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases 284195 Intermediate filament protein 284196 Intermediate filament protein 284200 Protein kinase domain 284201 Zinc finger, C3HC4 type (RING finger) 284202 Fatty acid desaturase 284203 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 284208 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 284210 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 284210 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 284212 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 284217 Laminin G domain 284217 Laminin B (Domain IV) 284217 Laminin N-terminal (Domain VI) 284217 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 284218 Mitochondrial carrier protein 284228 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 284229 Cyclophilin type peptidyl-prolyl cis-trans isomerase 284230 Ribosomal protein L44 284235 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 284249 L1 transposable element 284250 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 284250 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 284252 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 284265 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 284268 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 284270 Microtubule associated protein (MAP65/ASE1 family) 284287 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 284288 Ribosomal protein L24e 284293 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 284295 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 284299 Protein kinase domain 284301 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 284302 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 284302 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 284307 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 284312 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 284315 Ribosomal protein L23 284318 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 284319 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 284320 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 284323 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 284335 PH domain. PH stands for pleckstrin homology 284341 A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation 284345 PH domain. PH stands for pleckstrin homology 284347 Phospholipase A2 inhibitor 284355 Homeobox domain 284357 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 284358 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 284358 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 284363 Dihydrodipicolinate reductase 284363 Domain of unknown function DUF108. This family has no known function. It is found to compose the complete protein in archaebacteria and a single domain in a large C. elegans protein 284383 7 transmembrane receptor (rhodopsin family) 284387 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 284390 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 284391 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 284393 Ribosomal L10 284393 Ribosomal protein L16 284427 Mitochondrial carrier protein 284431 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 284432 Ribosomal protein L23 284434 WD domain, G-beta repeat 284436 AMP-binding enzyme 284438 Citrate synthase 284439 Mitochondrial carrier protein 284457 Ribosomal protein S27 284458 Ribosomal protein S2 284459 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 284479 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 284486 Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins 284486 Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins 284500 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee 284509 Helix-loop-helix DNA-binding domain 284515 Adenylate kinase 284521 7 transmembrane receptor (rhodopsin family) 284522 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 284525 Cyclic nucleotide-binding domain 284525 Cyclic nucleotide-binding domain 284525 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 284532 7 transmembrane receptor (rhodopsin family) 284533 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 284535 7 transmembrane receptor (rhodopsin family) 284541 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 284545 HMG (high mobility group) box 284553 Triosephosphate isomerase 284556 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 284556 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 284556 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 284563 Cyclophilin type peptidyl-prolyl cis-trans isomerase 284597 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 284613 Cytochrome b561 284614 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 284615 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 284642 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 284651 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 284652 CUB domain 284652 Sushi domain (SCR repeat) 284656 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 284659 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 284659 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 284674 Occludin/ELL family 284675 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 284676 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 284676 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 284679 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 284683 HMG (high mobility group) box 284684 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 284685 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 284689 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 284689 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 284690 Ubiquitin carboxyl-terminal hydrolase, family 1 284693 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 284696 XPG N-terminal domain 284697 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 284710 Ribosomal L29 protein 284711 Ribosomal protein S7e 284725 Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio 284759 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 284766 Ribosomal family S4e 284766 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 284767 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 284774 Ribosomal protein S17 284779 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 284810 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 284813 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 284819 Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inter 284820 Polyprenyl synthetase 284821 Ribosomal protein L13 284821 ssDNA binding protein. This protein is found in herpesviruses and is needed for replication 284832 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 284833 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 284843 Ribosomal protein L23 284849 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 284866 Proline dehydrogenase 284890 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 284897 Ribosomal L15 284904 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 284904 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 284905 Ribosomal S17 284918 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 284918 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 284922 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 284942 Ribosomal protein L23 284948 SH2 domain 284964 pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 284967 Trypsin 284988 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 284996 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 285003 Ribosomal protein L36 285004 Ribosomal L39 protein 285005 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 285010 Dihydrofolate reductase 285011 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285031 HMG (high mobility group) box 285035 Gamma-glutamyltranspeptidase 285044 Adenylate kinase 285046 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285053 Ribosomal L18ae protein family 285055 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 285062 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 285089 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 285099 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 285107 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 285109 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 285112 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285117 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe 285133 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 285148 Lipase/Acylhydrolase with GDSL-like motif 285164 Cytochrome c oxidase subunit III 285176 Ribosomal L10 285189 PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some t 285190 TPR Domain 285190 RanBP1 domain 285190 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 285192 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 285193 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 285195 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 285207 Ribosomal protein L36e 285208 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 285208 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 285214 Ribosomal protein L23 285216 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 285218 Ribosomal L18ae protein family 285227 HMG (high mobility group) box 285233 Intermediate filament protein 285234 Intermediate filament protein 285241 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 285242 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 285242 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 285242 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 285242 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 285244 Hsp90 protein 285245 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 285246 Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer 285260 Ribosomal protein L31e 285268 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 285269 Ribosomal protein S27 285280 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 285289 Hsp90 protein 285289 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 285292 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285301 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 285313 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 285320 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 285321 Tropomyosin 285329 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 285329 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 285335 Cyclic nucleotide-binding domain 285335 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 285335 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 285350 Ribosomal protein S24e 285358 Homeobox domain 285358 PBX domain. The PBX domain is a bipartite acidic domain 285361 Ribosomal protein S2 285362 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 285362 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 285373 Protein kinase domain 285384 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 285407 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 285410 Ribosomal protein L21e 285412 HMG (high mobility group) box 285426 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285433 PMP-22/EMP/MP20/Claudin family 285434 Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway 285440 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 285442 PCRF domain. This domain is found in peptide chain release factors 285442 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 285449 Ribosomal protein S12 285453 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 285458 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 285466 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 285484 HMG (high mobility group) box 285507 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285519 UDP-glucoronosyl and UDP-glucosyl transferase 285519 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 285536 Ribosomal protein L21e 285537 Ribosomal protein S27 285544 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 285554 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 285554 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 285554 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 285562 Core histone H2A/H2B/H3/H4 285590 PX domain. PX domains bind to phosphoinositides 285590 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 285598 ADP-ribosylation factor family 285605 DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after 285615 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 285625 S25 ribosomal protein 285641 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 285641 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 285643 Kinesin motor domain 285643 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 285655 Ribosomal L10 285658 Ribosomal protein L19e 285659 7 transmembrane receptor (rhodopsin family) 285669 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 285671 Zinc finger, C3HC4 type (RING finger) 285676 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 285702 Actin 285703 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 285720 Ribosomal S3Ae family 285728 HMG (high mobility group) box 285732 14-3-3 protein 285737 7 transmembrane receptor (rhodopsin family) 285741 Translationally controlled tumor protein 285748 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 285752 Ribosomal protein L3 285755 Cyclophilin type peptidyl-prolyl cis-trans isomerase 285761 LCCL domain 285761 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 285770 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 285783 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 285785 Ribosomal L39 protein 285787 Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in 285788 Protein kinase domain 285802 Zinc-binding dehydrogenase 285809 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 285829 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 285840 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 285845 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 285846 Ribosomal protein S8 285848 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o 285849 Cytochrome c oxidase subunit VIa 285850 Ribosomal protein L14p/L23e 285855 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 285865 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 285895 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 285897 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 285900 Ribosomal protein L6, N-terminal domain 285910 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 285911 Ribosomal protein L31e 285918 Kinesin motor domain 285926 DNA mismatch repair protein, C-terminal domain. This family represents the C-terminal domain of the mutL/hexB/PMS1 family. This domain has a ribosomal S5 domain 2-like fold 285929 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 285932 Ribosomal protein L44 285934 Helix-loop-helix DNA-binding domain 285938 Ribosomal protein S26e 285940 Protein kinase domain 285967 Ribosomal protein L44 285969 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 285984 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 285985 Ribosomal protein S27 286000 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 286016 Triosephosphate isomerase 286017 Protein kinase domain 286020 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 286061 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 286066 Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine 286088 sic protein. Serotype M1 group A Streptococcus strains cause epidemic waves of human infections. This family includes the sic protein an extracellular protein (streptococcal inhibitor of complement) that inhibits human complement 286091 Ribosomal protein S26e 286102 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 286106 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 286136 Protein kinase domain 286145 Ribosomal family S4e 286150 Ribosomal protein S5, C-terminal domain 286150 Ribosomal protein S5, N-terminal domain 286151 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 286153 Ribosomal protein S26e 286157 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 286172 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 286181 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 286192 Hsp90 protein 286220 Helix-loop-helix DNA-binding domain 286222 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 286222 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 286228 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 286239 Malic enzyme, N-terminal domain 286240 Intermediate filament protein 286246 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 286246 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 286255 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 286256 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 286256 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 286310 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 286346 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 286346 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 286348 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 286360 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 286362 7 transmembrane receptor (rhodopsin family) 286364 7 transmembrane receptor (rhodopsin family) 286365 7 transmembrane receptor (rhodopsin family) 286380 Fork head domain 286397 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 286398 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 286400 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 286400 Proliferating cell nuclear antigen, N-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 286402 Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase 286409 Eukaryotic porin 286420 ADP-ribosylation factor family 286423 Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round 286425 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains 286425 Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF03727. Some members of the family have two copies of each of these domains 286426 Eukaryotic porin 286428 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 286429 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 286436 Core histone H2A/H2B/H3/H4 286436 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 286444 Ribosomal protein S5, C-terminal domain 286444 Ribosomal protein S5, N-terminal domain 286454 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 286462 Cyclin-dependent kinase regulatory subunit 286468 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe 286469 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 286472 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 286474 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 286474 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 286479 Glutathione peroxidase 286480 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 286482 Hsp90 protein 286486 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 286487 14-3-3 protein 286488 Protein kinase domain 286489 AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d 286494 Protein kinase domain 286500 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 286502 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 286503 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 286512 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 286513 Ribosomal protein S26e 286514 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of 286516 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 286517 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 286517 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 286526 ADP-ribosylation factor family 286526 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 286530 7 transmembrane receptor (rhodopsin family) 286530 7 transmembrane receptor (rhodopsin family) 286532 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 286535 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 286539 Ribosomal protein S26e 286544 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 286546 Sulfatase 286555 WD domain, G-beta repeat 286590 Ribosomal protein L23 286653 F-box domain 286662 Eukaryotic initiation factor 4E 286722 Protein kinase domain 286749 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 286749 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep 286758 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 286759 Casein 286887 Intermediate filament protein 286888 WAP-type (Whey Acidic Protein) 'four-disulfide core' 286889 Somatotropin hormone family 286890 Tropomyosin 286892 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286892 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286893 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286896 Aminotransferase class-V 286896 Aminotransferase class I and II 286896 Pyridoxal-dependent decarboxylase conserved domain 286898 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r 286898 ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen re 286901 Giardia variant-specific surface protein 286901 Giardia variant-specific surface protein 286901 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 286904 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286904 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286905 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286905 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286908 LIM domain. This family represents two copies of the LIM structural domain 286908 LIM domain. This family represents two copies of the LIM structural domain 286908 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 286911 Trypsin 286912 Intermediate filament protein 286913 7 transmembrane receptor (rhodopsin family) 286913 7 transmembrane receptor (rhodopsin family) 286914 7 transmembrane receptor (metabotropic glutamate family) 286914 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286914 7 transmembrane receptor (metabotropic glutamate family) 286914 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286915 7 transmembrane receptor (metabotropic glutamate family) 286915 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286916 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 286918 Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169 286920 Amiloride-sensitive sodium channel 286920 Amiloride-sensitive sodium channel 286921 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 286921 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 286922 Fatty acid desaturase 286922 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 286922 Fatty acid desaturase 286922 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 286923 Guanylate-kinase-associated protein (GKAP) protein 286923 Guanylate-kinase-associated protein (GKAP) protein 286924 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 286924 ADP-ribosylation factor family 286924 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 286925 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 286926 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 286927 Protein kinase domain 286927 Protein kinase domain 286929 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286930 Guanylate-kinase-associated protein (GKAP) protein 286930 Guanylate-kinase-associated protein (GKAP) protein 286931 C2 domain 286931 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 286933 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 286933 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 286936 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 286938 GTPase of unknown function 286940 Filamin/ABP280 repeat 286953 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286954 UDP-glucoronosyl and UDP-glucosyl transferase 286954 UDP-glucoronosyl and UDP-glucosyl transferase 286954 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 286955 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286956 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286956 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286957 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286957 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 286958 Mammalian defensin 286958 Defensin propeptide 286958 Mammalian defensin 286958 Defensin propeptide 286959 7 transmembrane receptor (rhodopsin family) 286959 7 transmembrane receptor (rhodopsin family) 286960 Trypsin 286960 Trypsin 286962 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 286963 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286963 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 286964 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 286964 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 286966 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 286974 Sulfotransferase protein 286976 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo 286981 7 transmembrane receptor (metabotropic glutamate family) 286981 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286981 7 transmembrane receptor (metabotropic glutamate family) 286981 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286982 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286982 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286983 7 transmembrane receptor (metabotropic glutamate family) 286983 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286983 7 transmembrane receptor (metabotropic glutamate family) 286983 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286984 7 transmembrane receptor (metabotropic glutamate family) 286984 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286984 7 transmembrane receptor (metabotropic glutamate family) 286984 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286985 7 transmembrane receptor (metabotropic glutamate family) 286985 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286986 7 transmembrane receptor (metabotropic glutamate family) 286986 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286986 7 transmembrane receptor (metabotropic glutamate family) 286986 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 286989 UDP-glucoronosyl and UDP-glucosyl transferase 286990 HMG (high mobility group) box 286990 HMG (high mobility group) box 286991 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated 286994 7 transmembrane receptor (rhodopsin family) 286995 Defensin propeptide 286995 Mammalian defensin 286995 Defensin propeptide 286996 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 286997 Protein kinase domain 286998 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 286998 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287000 7 transmembrane receptor (rhodopsin family) 287000 7 transmembrane receptor (rhodopsin family) 287001 7 transmembrane receptor (rhodopsin family) 287001 7 transmembrane receptor (rhodopsin family) 287002 7 transmembrane receptor (rhodopsin family) 287002 7 transmembrane receptor (rhodopsin family) 287006 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 287006 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 287006 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 287006 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 287029 Ribosomal protein L31e 287031 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 287042 ParA family ATPase 287042 4Fe-4S iron sulfur cluster binding proteins, NifH/frxC family 287046 14-3-3 protein 287049 PMP-22/EMP/MP20/Claudin family 287051 Ribosomal protein L6 287055 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 287056 ribosomal L5P family C-terminus. This region is found associated with pfam00281 287060 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 287060 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 287065 Uncharacterised protein family (UPF0108) 287068 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 287069 Hsp90 protein 287070 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 287074 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 287075 Thymosin beta-4 family 287077 7 transmembrane receptor (rhodopsin family) 287079 7 transmembrane receptor (rhodopsin family) 287080 7 transmembrane receptor (rhodopsin family) 287081 7 transmembrane receptor (rhodopsin family) 287082 7 transmembrane receptor (rhodopsin family) 287083 7 transmembrane receptor (rhodopsin family) 287084 7 transmembrane receptor (rhodopsin family) 287085 7 transmembrane receptor (rhodopsin family) 287086 7 transmembrane receptor (rhodopsin family) 287087 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 287088 7 transmembrane receptor (rhodopsin family) 287089 7 transmembrane receptor (rhodopsin family) 287090 7 transmembrane receptor (rhodopsin family) 287090 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 287091 7 transmembrane receptor (rhodopsin family) 287092 Ribosomal family S4e 287093 7 transmembrane receptor (rhodopsin family) 287095 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 287095 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 287096 Ribosomal protein L44 287098 PMP-22/EMP/MP20/Claudin family 287099 PMP-22/EMP/MP20/Claudin family 287102 CUB domain 287102 WSC domain. This domain may be involved in carbohydrate binding 287102 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 287104 Trypsin 287105 S25 ribosomal protein 287106 Trypsin 287107 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 287108 Trypsin 287109 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 287110 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 287114 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 287115 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 287120 WD domain, G-beta repeat 287123 SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad 287125 ParA family ATPase 287125 4Fe-4S iron sulfur cluster binding proteins, NifH/frxC family 287125 ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems 287125 Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell 287130 Pentaxin family. Pentaxins are also known as pentraxins 287136 Trypsin 287137 Trypsin 287138 Trypsin 287140 Trypsin 287141 Trypsin 287142 Trypsin 287143 Trypsin 287145 RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to 287148 RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to 287155 TPR Domain 287160 Calpain family cysteine protease 287161 Ribosomal protein L34e 287162 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 287164 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 287165 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 287167 Globin 287168 Globin 287169 Uncharacterised protein family (UPF0171) 287172 Ribosomal protein L35Ae 287174 Ribosomal Proteins L2, C-terminal domain 287178 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287180 Protein kinase domain 287182 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 287185 Fork head domain 287188 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 287189 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 287191 tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase 287195 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 287199 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 287200 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 287203 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 287203 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 287204 S25 ribosomal protein 287206 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 287207 Ribosomal protein S5, C-terminal domain 287207 Ribosomal protein S5, N-terminal domain 287208 TPR Domain 287209 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 287215 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 287223 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 287227 Mitosis protein DIM1 287227 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 287228 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 287229 CUB domain 287230 Thymosin beta-4 family 287232 7 transmembrane receptor (rhodopsin family) 287234 7 transmembrane receptor (rhodopsin family) 287235 Ribosomal protein L31e 287236 7 transmembrane receptor (rhodopsin family) 287237 7 transmembrane receptor (rhodopsin family) 287239 7 transmembrane receptor (rhodopsin family) 287240 7 transmembrane receptor (rhodopsin family) 287241 7 transmembrane receptor (rhodopsin family) 287242 7 transmembrane receptor (rhodopsin family) 287243 7 transmembrane receptor (rhodopsin family) 287244 7 transmembrane receptor (rhodopsin family) 287245 7 transmembrane receptor (rhodopsin family) 287245 1,3-beta-glucan synthase component. This family consists of various 1,3-beta-glucan synthase components including Gls1, Gls2 and Gls3 from yeast. 1,3-beta-glucan synthase EC:2.4.1.34 also known as callose synthase catalyses the formation of a beta-1,3-glu 287246 7 transmembrane receptor (rhodopsin family) 287247 7 transmembrane receptor (rhodopsin family) 287248 Ribosomal protein S28e 287249 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 287250 Ribosomal L29e protein family 287251 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 287252 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 287254 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 287255 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 287258 7 transmembrane receptor (rhodopsin family) 287259 7 transmembrane receptor (rhodopsin family) 287260 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 287260 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 287261 7 transmembrane receptor (rhodopsin family) 287262 7 transmembrane receptor (rhodopsin family) 287263 7 transmembrane receptor (rhodopsin family) 287264 7 transmembrane receptor (rhodopsin family) 287265 7 transmembrane receptor (rhodopsin family) 287266 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 287268 Protein kinase domain 287269 Protein kinase domain 287270 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 287273 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 287274 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 287276 ADP-ribosylation factor family 287276 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 287276 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 287277 Proteasome A-type and B-type 287280 Skp1 family, dimerisation domain 287280 Skp1 family, tetramerisation domain 287281 Eukaryotic porin 287282 Ribosomal protein S5, N-terminal domain 287283 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 287284 Ribosomal L10 287287 Interleukin 4 287291 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 287293 HMG (high mobility group) box 287300 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 287302 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 287305 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 287307 'Cold-shock' DNA-binding domain 287309 7 transmembrane receptor (rhodopsin family) 287310 7 transmembrane receptor (rhodopsin family) 287311 7 transmembrane receptor (rhodopsin family) 287312 7 transmembrane receptor (rhodopsin family) 287313 7 transmembrane receptor (rhodopsin family) 287314 7 transmembrane receptor (rhodopsin family) 287315 7 transmembrane receptor (rhodopsin family) 287316 7 transmembrane receptor (rhodopsin family) 287317 7 transmembrane receptor (rhodopsin family) 287318 7 transmembrane receptor (rhodopsin family) 287319 7 transmembrane receptor (rhodopsin family) 287320 7 transmembrane receptor (rhodopsin family) 287321 7 transmembrane receptor (rhodopsin family) 287322 7 transmembrane receptor (rhodopsin family) 287323 7 transmembrane receptor (rhodopsin family) 287324 7 transmembrane receptor (rhodopsin family) 287325 7 transmembrane receptor (rhodopsin family) 287326 7 transmembrane receptor (rhodopsin family) 287328 7 transmembrane receptor (rhodopsin family) 287329 7 transmembrane receptor (rhodopsin family) 287330 7 transmembrane receptor (rhodopsin family) 287331 7 transmembrane receptor (rhodopsin family) 287333 7 transmembrane receptor (rhodopsin family) 287334 7 transmembrane receptor (rhodopsin family) 287335 7 transmembrane receptor (rhodopsin family) 287336 7 transmembrane receptor (rhodopsin family) 287337 7 transmembrane receptor (rhodopsin family) 287338 Enolase, N-terminal domain 287338 Enolase, C-terminal TIM barrel domain 287339 7 transmembrane receptor (rhodopsin family) 287340 7 transmembrane receptor (rhodopsin family) 287341 7 transmembrane receptor (rhodopsin family) 287342 7 transmembrane receptor (rhodopsin family) 287343 7 transmembrane receptor (rhodopsin family) 287344 7 transmembrane receptor (rhodopsin family) 287345 7 transmembrane receptor (rhodopsin family) 287347 7 transmembrane receptor (rhodopsin family) 287348 7 transmembrane receptor (rhodopsin family) 287349 7 transmembrane receptor (rhodopsin family) 287352 Core histone H2A/H2B/H3/H4 287353 Protein kinase domain 287354 Guanylate kinase 287354 Rad17 cell cycle checkpoint protein 287354 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 287354 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 287357 wnt family 287358 Trypsin 287359 jmjC domain 287360 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 287361 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 287361 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 287361 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 287362 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 287363 7 transmembrane receptor (rhodopsin family) 287364 Uncharacterized protein family UPF0023 287365 7 transmembrane receptor (rhodopsin family) 287367 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 287373 GTP1/OBG family 287373 TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae 287379 Serine hydroxymethyltransferase 287380 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 287382 Fibrinogen beta and gamma chains, C-terminal globular domain 287384 SH2 domain 287384 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 287385 Sodium:solute symporter family 287392 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 287393 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 287394 Sulfotransferase protein 287395 Sulfotransferase protein 287398 Protein kinase domain 287399 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 287400 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 287400 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287401 Ribosomal L15 287402 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 287403 HIT family 287408 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 287409 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 287411 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 287416 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 287418 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 287423 Helix-loop-helix DNA-binding domain 287424 Lipoxygenase 287425 Lipoxygenase 287425 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 287426 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 287427 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 287429 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 287435 Lysosome-associated membrane glycoprotein (Lamp) 287436 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 287436 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 287437 TNF(Tumor Necrosis Factor) family 287438 Ribosomal protein L13e 287441 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 287442 Protein of unknown function (DUF423). Potential integral membrane protein 287444 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 287447 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 287448 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 287449 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 287454 Lipoxygenase 287459 Ribosomal protein L21e 287465 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 287466 Sulfotransferase protein 287466 WSC domain. This domain may be involved in carbohydrate binding 287467 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 287468 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 287469 Fork head domain 287470 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 287473 Protein kinase domain 287478 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. Howe 287479 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 287480 7 transmembrane receptor (rhodopsin family) 287481 7 transmembrane receptor (rhodopsin family) 287482 7 transmembrane receptor (rhodopsin family) 287483 7 transmembrane receptor (rhodopsin family) 287484 7 transmembrane receptor (rhodopsin family) 287486 7 transmembrane receptor (rhodopsin family) 287487 7 transmembrane receptor (rhodopsin family) 287488 7 transmembrane receptor (rhodopsin family) 287489 7 transmembrane receptor (rhodopsin family) 287490 7 transmembrane receptor (rhodopsin family) 287491 7 transmembrane receptor (rhodopsin family) 287492 7 transmembrane receptor (rhodopsin family) 287493 7 transmembrane receptor (rhodopsin family) 287494 7 transmembrane receptor (rhodopsin family) 287495 7 transmembrane receptor (rhodopsin family) 287496 7 transmembrane receptor (rhodopsin family) 287497 7 transmembrane receptor (rhodopsin family) 287498 7 transmembrane receptor (rhodopsin family) 287500 7 transmembrane receptor (rhodopsin family) 287501 7 transmembrane receptor (rhodopsin family) 287502 7 transmembrane receptor (rhodopsin family) 287503 7 transmembrane receptor (rhodopsin family) 287504 7 transmembrane receptor (rhodopsin family) 287505 7 transmembrane receptor (rhodopsin family) 287506 7 transmembrane receptor (rhodopsin family) 287507 7 transmembrane receptor (rhodopsin family) 287508 7 transmembrane receptor (rhodopsin family) 287510 7 transmembrane receptor (rhodopsin family) 287511 7 transmembrane receptor (rhodopsin family) 287512 Ribosomal protein S6e 287513 7 transmembrane receptor (rhodopsin family) 287514 7 transmembrane receptor (rhodopsin family) 287515 7 transmembrane receptor (rhodopsin family) 287516 7 transmembrane receptor (rhodopsin family) 287517 7 transmembrane receptor (rhodopsin family) 287519 7 transmembrane receptor (rhodopsin family) 287520 7 transmembrane receptor (rhodopsin family) 287521 Helix-loop-helix DNA-binding domain 287522 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 287523 Domain of unknown function (DUF341) 287525 MYND finger 287526 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 287527 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 287530 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 287533 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 287537 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 287539 Ribosomal protein S2 287543 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 287546 Flavodoxin 287546 pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain 287546 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 287546 FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase 287547 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287551 Ribosomal protein L6 287552 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 287553 Ribosomal protein S19 287556 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 287557 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 287558 Ribosomal protein L35Ae 287559 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 287561 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 287562 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 287563 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 287565 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 287566 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 287567 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 287570 Growth-Arrest-Specific Protein 2 Domain 287570 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 287574 WAP-type (Whey Acidic Protein) 'four-disulfide core' 287575 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287576 Ribosomal protein S8 287582 Homeobox domain 287583 Homeobox domain 287593 Domain of unknown function UPF0099. This domain has no known function 287601 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287604 Ribosomal protein L21e 287606 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 287607 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 287609 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 287609 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 287609 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 287609 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 287610 Animal haem peroxidase 287611 Ribosomal protein S6e 287613 7 transmembrane receptor (rhodopsin family) 287614 7 transmembrane receptor (rhodopsin family) 287618 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287619 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287620 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287622 VHS domain. Domain present in VPS-27, Hrs and STAM 287622 GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins 287623 Ribosomal L29 protein 287624 Kinesin motor domain 287625 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 287625 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287627 Eukaryotic-type carbonic anhydrase 287631 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287631 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 287635 Ribosomal L27 protein 287636 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 287636 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 287638 Homeobox domain 287639 Ribosomal L28e protein family 287643 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 287644 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 287645 EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of tr 287646 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 287647 Homeobox domain 287648 PH domain. PH stands for pleckstrin homology 287650 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 287651 Ribosomal protein S26e 287658 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 287659 Hyaluronidase 287667 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 287670 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 287673 7 transmembrane receptor (rhodopsin family) 287675 Intermediate filament protein 287677 Intermediate filament protein 287678 Intermediate filament protein 287679 Intermediate filament protein 287685 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int 287686 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 287691 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 287697 Intermediate filament protein 287697 Male specific sperm protein. This family of drosophila proteins are typified by the repetitive motif C-G-P 287697 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 287698 Intermediate filament protein 287699 Intermediate filament protein 287700 Intermediate filament protein 287701 Intermediate filament protein 287702 Intermediate filament protein 287703 Translation initiation factor SUI1 287704 FKBP-type peptidyl-prolyl cis-trans isomerase 287706 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 287707 ADP-ribosylation factor family 287707 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 287708 ADP-ribosylation factor family 287708 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 287709 ADP-ribosylation factor family 287709 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 287711 Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction 287712 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 287716 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 287716 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 287717 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 287720 Zinc-binding dehydrogenase 287721 Zinc-binding dehydrogenase 287723 Apoptosis regulator proteins, Bcl-2 family 287726 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 287729 Protein-tyrosine phosphatase 287729 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 287730 Pancreatic hormone peptide 287732 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 287735 Mitochondrial carrier protein 287737 Ribosomal protein L36e 287739 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 287740 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 287742 Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction 287745 C2 domain 287745 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 287745 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 287746 Formin Homology 2 Domain 287748 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 287748 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 287751 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 287753 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 287753 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 287758 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 287760 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 287763 Protein kinase domain 287764 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 287765 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 287765 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 287769 Actin 287770 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 287772 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 287774 Sushi domain (SCR repeat) 287775 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 287778 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287782 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 287784 Ribosomal protein L23 287786 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 287792 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 287801 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 287801 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287805 7 transmembrane receptor (metabotropic glutamate family) 287807 ENV polyprotein (coat polyprotein) 287808 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287809 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287809 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 287810 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287811 Mitochondrial carrier protein 287814 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 287817 Ribosomal protein L6e 287819 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 287820 Protein of unknown function, DUF270 287821 Protein of unknown function, DUF270 287824 B-box zinc finger 287824 Zinc finger, C3HC4 type (RING finger) 287824 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 287825 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 287827 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 287829 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 287831 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 287833 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 287835 GHMP kinases putative ATP-binding protein 287837 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 287838 Protein kinase domain 287844 Ribosomal protein L11, RNA binding domain 287846 Eukaryotic ribosomal protein L18 287848 Myosin head (motor domain) 287849 Myosin head (motor domain) 287853 Myosin head (motor domain) 287859 Myosin head (motor domain) 287862 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 287863 Myosin head (motor domain) 287868 Sulfate transporter family. Mutations may lead to several human diseases 287868 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 287876 Actin 287877 Delta 1-pyrroline-5-carboxylate reductase 287877 NAD-dependent glycerol-3-phosphate dehydrogenase 287877 pfam02871, Octopine_DH_N, NAD/NADP octopine/nopaline dehydrogenase, NAD binding domain. This group of enzymes act on the CH-NH substrate bond using NAD(+) or NADP(+) as an acceptor. The Pfam family consists mainly of octopine and nopaline dehydrogenases f 287881 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 287888 7 transmembrane receptor (rhodopsin family) 287893 Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterised but probably have a similar or identical function 287895 14-3-3 protein 287898 7 transmembrane receptor (metabotropic glutamate family) 287900 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 287902 Ribosomal L15 287903 HIT family 287905 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 287908 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 287909 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 287910 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 287911 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287912 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287913 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287915 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 287917 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 287919 Thymosin beta-4 family 287920 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 287923 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 287923 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 287928 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 287931 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 287933 Cyclophilin type peptidyl-prolyl cis-trans isomerase 287939 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 287940 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 287940 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 287941 SH2 domain 287942 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 287943 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 287945 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 287947 Ribosomal protein L31e 287950 Proline dehydrogenase 287953 Conserved hypothetical protein 95 287953 ubiE/COQ5 methyltransferase family 287953 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 287953 DNA methylase. Members of this family are DNA methylases. The family contains both N-4 cytosine-specific DNA methylases and N-6 Adenine-specific DNA methylases 287953 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 287953 Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity 287953 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 287955 NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only 287960 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 287961 CDC45-like protein. CDC45 is an essential gene required for initiation of DNA replication in S. cerevisiae, forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human, mouse and smut fungus among others 287964 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287965 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287966 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 287968 7 transmembrane receptor (rhodopsin family) 287969 7 transmembrane receptor (rhodopsin family) 287970 7 transmembrane receptor (rhodopsin family) 287971 7 transmembrane receptor (rhodopsin family) 287972 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 287973 7 transmembrane receptor (rhodopsin family) 287974 Zn-finger in Ran binding protein and others 287974 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 287975 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 287979 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 287980 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 287981 Ribosomal protein S8 287982 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 287986 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 287988 RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits 287989 Protein kinase domain 287989 Fibronectin type III domain 287989 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 287989 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 287993 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287994 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 287995 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 287996 Ribosomal protein L21e 287997 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 287999 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288000 Core histone H2A/H2B/H3/H4 288001 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288002 Core histone H2A/H2B/H3/H4 288003 Rad17 cell cycle checkpoint protein 288003 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 288004 S-adenosyl-L-homocysteine hydrolase 288005 Ribosomal protein L21e 288006 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 288006 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288007 Trypsin 288007 CUB domain 288007 Sushi domain (SCR repeat) 288018 Poly-adenylate binding protein, unique domain 288018 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288019 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 288020 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 288020 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 288021 ENV polyprotein (coat polyprotein) 288024 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288024 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288026 E1-E2 ATPase 288026 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 288030 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 288036 Ribosomal protein L31e 288037 Cyclophilin type peptidyl-prolyl cis-trans isomerase 288039 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288039 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288040 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288042 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 288044 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 288046 Ribosomal protein L10 288047 Leishmanolysin 288051 Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phosphat 288052 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 288052 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 288057 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 288059 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 288060 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 288063 ICE-like protease (caspase) p20 domain 288064 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 288066 Ribosomal protein S5, N-terminal domain 288067 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288068 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288070 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288071 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288072 Orbivirus VP3 (T2) protein. The orbivirus VP3 protein is part of the virus core and makes a 'subcore' shell made up of 120 copies of the 100K protein. VP3 particles can also bind RNA and are fundamental in the early stages of viral core formation. Also fo 288072 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288073 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 288074 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288075 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 288077 Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018 288077 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 288079 DNA polymerase family A 288079 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 288082 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288083 homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme 288084 TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins t 288085 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288086 homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme 288087 Intermediate filament protein 288087 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 288089 Ribosomal protein S17 288095 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 288096 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 288097 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288097 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288098 Cyclophilin type peptidyl-prolyl cis-trans isomerase 288099 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 288100 Ribosomal protein L21e 288103 Protein kinase domain 288106 7 transmembrane receptor (rhodopsin family) 288108 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 288113 GTPase of unknown function 288114 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288115 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 288117 Cyclic nucleotide-binding domain 288117 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 288118 Trypsin 288122 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 288126 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 288127 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold 288127 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 288129 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 288130 Ribosomal protein L10 288132 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 288134 Ribosomal protein L6 288136 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 288137 Myosin head (motor domain) 288138 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 288140 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288140 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288144 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 288145 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 288147 Eukaryotic porin 288149 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 288151 Homeobox domain 288152 Homeobox domain 288153 Homeobox domain 288154 Homeobox domain 288155 Homeobox domain 288156 Ribosomal protein S26e 288158 Ribosomal protein S7e 288159 HMG (high mobility group) box 288160 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 288161 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 288162 Ribosomal protein L31e 288163 DNA directed RNA polymerase, 7 kDa subunit 288167 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 288171 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 288172 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 288173 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 288174 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 288175 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 288176 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 288178 Protein kinase domain 288179 Translationally controlled tumor protein 288179 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288180 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 288181 7 transmembrane receptor (rhodopsin family) 288182 PMP-22/EMP/MP20/Claudin family 288183 7 transmembrane receptor (rhodopsin family) 288184 7 transmembrane receptor (rhodopsin family) 288185 7 transmembrane receptor (rhodopsin family) 288186 7 transmembrane receptor (rhodopsin family) 288187 7 transmembrane receptor (rhodopsin family) 288187 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288188 7 transmembrane receptor (rhodopsin family) 288189 7 transmembrane receptor (rhodopsin family) 288190 7 transmembrane receptor (rhodopsin family) 288191 7 transmembrane receptor (rhodopsin family) 288192 7 transmembrane receptor (rhodopsin family) 288193 7 transmembrane receptor (rhodopsin family) 288194 7 transmembrane receptor (rhodopsin family) 288195 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 288196 7 transmembrane receptor (rhodopsin family) 288197 7 transmembrane receptor (rhodopsin family) 288198 7 transmembrane receptor (rhodopsin family) 288199 7 transmembrane receptor (rhodopsin family) 288200 7 transmembrane receptor (rhodopsin family) 288201 7 transmembrane receptor (rhodopsin family) 288202 7 transmembrane receptor (rhodopsin family) 288203 7 transmembrane receptor (rhodopsin family) 288204 QXW lectin repeat 288205 Ribosomal protein L36e 288207 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 288207 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 288207 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 288208 Fibronectin type III domain 288209 Fibronectin type III domain 288212 Ribosomal protein L34e 288215 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 288215 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 288216 Phosphoglycerate kinase 288217 Eukaryotic initiation factor 1A 288218 Eukaryotic initiation factor 1A 288219 Ribosomal protein L19e 288220 Ribosomal protein L19e 288221 NOL1/NOP2/sun family 288226 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288230 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 288235 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 288241 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 288243 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 288246 Actinobacillus constitutively-expressed outer membrane lipoprotein A 288246 Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element 288246 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 288251 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 288256 Zinc-binding dehydrogenase 288258 Phosphoribosylglycinamide synthetase, C domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosph 288259 Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate defor 288260 Ribosomal protein S7e 288265 Ribosomal protein S7e 288267 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 288276 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288280 Fibronectin type III domain 288280 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 288281 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 288282 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 288283 Mitochondrial carrier protein 288285 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 288286 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 288287 CUB domain 288287 MAM domain. An extracellular domain found in many receptors 288287 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 288288 CUB domain 288288 MAM domain. An extracellular domain found in many receptors 288288 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 288289 Lectin C-type domain. This family includes both long and short form C-type 288290 Trypsin 288291 Trypsin 288293 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 288294 Enolase, N-terminal domain 288294 Enolase, C-terminal TIM barrel domain 288301 Protein-tyrosine phosphatase 288301 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 288304 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 288305 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 288306 Ubiquitin carboxyl-terminal hydrolase family 2 288307 Ubiquitin carboxyl-terminal hydrolases family 2 288309 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 288310 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288311 Mitochondrial carrier protein 288312 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 288312 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 288313 Ribosomal L29e protein family 288314 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 288314 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 288316 Zinc-binding dehydrogenase 288317 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288318 Ribosomal L29e protein family 288321 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288322 Lipase 288323 HMG (high mobility group) box 288328 Eukaryotic initiation factor 4E 288330 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 288332 pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase 288333 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 288340 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 288340 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 288344 Eukaryotic porin 288345 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 288346 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 288347 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 288348 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 288349 Eukaryotic protein of unknown function, DUF279 288352 RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain 288355 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 288356 Homeobox domain 288358 Protein kinase domain 288360 Eukaryotic-type carbonic anhydrase 288362 Myosin head (motor domain) 288364 Queuine tRNA-ribosyltransferase. This is a family of queuine tRNA-ribosyltransferases EC:2.4.2.29, also known as tRNA-guanine transglycosylase and guanine insertion enzyme. Queuine tRNA-ribosyltransferase modifies tRNAs for asparagine, aspartic acid, hist 288366 Ribosomal protein L21e 288367 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 288375 Lectin C-type domain. This family includes both long and short form C-type 288376 Lectin C-type domain. This family includes both long and short form C-type 288377 Lectin C-type domain. This family includes both long and short form C-type 288378 Lectin C-type domain. This family includes both long and short form C-type 288379 Lectin C-type domain. This family includes both long and short form C-type 288381 Lectin C-type domain. This family includes both long and short form C-type 288383 Lectin C-type domain. This family includes both long and short form C-type 288384 Lectin C-type domain. This family includes both long and short form C-type 288385 Lectin C-type domain. This family includes both long and short form C-type 288386 Lectin C-type domain. This family includes both long and short form C-type 288387 Transcription initiation factor TFIID subunit A 288387 Lectin C-type domain. This family includes both long and short form C-type 288388 Lectin C-type domain. This family includes both long and short form C-type 288389 Glutamyl-tRNAGlu reductase. This family use NADPH as a cofactor 288389 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 288391 Zona pellucida-like domain 288394 7 transmembrane receptor (metabotropic glutamate family) 288395 7 transmembrane receptor (metabotropic glutamate family) 288395 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288396 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288397 7 transmembrane receptor (metabotropic glutamate family) 288398 Poly-adenylate binding protein, unique domain 288399 7 transmembrane receptor (metabotropic glutamate family) 288401 7 transmembrane receptor (metabotropic glutamate family) 288402 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288403 7 transmembrane receptor (metabotropic glutamate family) 288404 'chromo' (CHRromatin Organization MOdifier) domain 288405 'chromo' (CHRromatin Organization MOdifier) domain 288406 'chromo' (CHRromatin Organization MOdifier) domain 288407 'chromo' (CHRromatin Organization MOdifier) domain 288408 'chromo' (CHRromatin Organization MOdifier) domain 288409 'chromo' (CHRromatin Organization MOdifier) domain 288410 'chromo' (CHRromatin Organization MOdifier) domain 288412 'chromo' (CHRromatin Organization MOdifier) domain 288413 'chromo' (CHRromatin Organization MOdifier) domain 288415 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 288419 'chromo' (CHRromatin Organization MOdifier) domain 288420 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 288421 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 288421 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 288422 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 288422 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 288425 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 288427 ENV polyprotein (coat polyprotein) 288427 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 288428 Proteasome A-type and B-type 288429 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288430 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288432 'chromo' (CHRromatin Organization MOdifier) domain 288434 'chromo' (CHRromatin Organization MOdifier) domain 288435 'chromo' (CHRromatin Organization MOdifier) domain 288436 'chromo' (CHRromatin Organization MOdifier) domain 288437 'chromo' (CHRromatin Organization MOdifier) domain 288438 'chromo' (CHRromatin Organization MOdifier) domain 288439 'chromo' (CHRromatin Organization MOdifier) domain 288442 Translationally controlled tumor protein 288446 Ribosomal L28e protein family 288448 Eukaryotic ribosomal protein L18 288449 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 288452 Cyclophilin type peptidyl-prolyl cis-trans isomerase 288456 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 288456 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 288456 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 288457 Homeobox domain 288458 RNA polymerases L / 13 to 16 kDa subunit 288459 Ubiquitin carboxyl-terminal hydrolase family 2 288460 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 288462 Hsp90 protein 288463 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 288465 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 288469 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 288469 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 288470 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 288471 Phosphatidylinositol 3- and 4-kinase 288471 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 288472 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 288474 ENV polyprotein (coat polyprotein) 288475 Pentaxin family. Pentaxins are also known as pentraxins 288480 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 288481 Fructose-bisphosphate aldolase class-I 288483 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288485 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 288486 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288489 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288490 7 transmembrane receptor (rhodopsin family) 288491 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 288492 Fibronectin type III domain 288493 Fibronectin type III domain 288494 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288494 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288496 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 288499 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their o 288504 PX domain. PX domains bind to phosphoinositides 288512 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 288515 Kinesin motor domain 288515 LIM domain. This family represents two copies of the LIM structural domain 288515 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 288516 Kinesin motor domain 288517 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 288518 Protein of unknown function (DUF410). Family of conserved eukaryotic proteins with undetermined function 288527 Maf-like protein. Maf is a putative inhibitor of septum formation in eukaryotes, bacteria, and archaea 288528 7 transmembrane receptor (rhodopsin family) 288529 Connexin 288531 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288532 MCM2/3/5 family 288537 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288538 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288539 7 transmembrane receptor (metabotropic glutamate family) 288541 7 transmembrane receptor (metabotropic glutamate family) 288541 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288543 7 transmembrane receptor (metabotropic glutamate family) 288544 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 288550 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288551 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288556 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 288557 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 288558 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 288560 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 288562 Peptidase family M28D 288562 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 288562 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 288567 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288569 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 288571 7 transmembrane receptor (metabotropic glutamate family) 288571 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288572 7 transmembrane receptor (metabotropic glutamate family) 288573 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288574 7 transmembrane receptor (metabotropic glutamate family) 288574 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288575 7 transmembrane receptor (metabotropic glutamate family) 288576 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288578 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288579 7 transmembrane receptor (metabotropic glutamate family) 288580 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288581 7 transmembrane receptor (metabotropic glutamate family) 288581 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 288582 PMP-22/EMP/MP20/Claudin family 288583 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 288585 ADP-ribosylation factor family 288585 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288586 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 288588 RNA polymerases L / 13 to 16 kDa subunit 288593 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 288594 Ribosomal protein S15 288595 NOL1/NOP2/sun family 288596 B-box zinc finger 288597 FKBP-type peptidyl-prolyl cis-trans isomerase 288598 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 288598 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 288599 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288602 ENV polyprotein (coat polyprotein) 288605 PMP-22/EMP/MP20/Claudin family 288606 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 288608 Phosphorylase family 2 288613 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 288614 Ribosomal protein S8 288615 Uncharacterized protein family UPF0023 288618 Ribosomal protein S5, C-terminal domain 288619 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 288620 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 288621 Ribosomal protein S17 288622 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 288623 S25 ribosomal protein 288624 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288625 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288632 7 transmembrane receptor (rhodopsin family) 288633 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 288636 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 288637 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 288640 Ribosomal protein L19e 288640 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 288641 Glutaredoxin 288642 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 288646 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 288649 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 288651 Transcription factor Tfb4 288653 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 288655 Phosphatidylinositol transfer protein 288660 Caldesmon 288660 Intermediate filament protein 288660 DNA gyrase/topoisomerase IV, subunit A 288660 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 288660 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 288660 Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element 288660 I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin 288660 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 288660 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 288662 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 288664 PH domain. PH stands for pleckstrin homology 288665 Homeobox domain 288665 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 288668 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 288669 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 288674 Protein kinase domain 288675 Disintegrin 288675 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 288675 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 288677 Cyclophilin type peptidyl-prolyl cis-trans isomerase 288682 PH domain. PH stands for pleckstrin homology 288682 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 288692 WD domain, G-beta repeat 288694 Ribosomal S17 288698 Protein kinase domain 288700 ADP-ribosylation factor family 288700 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 288700 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288701 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288723 TPR Domain 288729 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 288730 CBF/Mak21 family 288731 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 288736 Fork head domain 288738 ATP synthase subunit C 288740 NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do n 288741 Fic protein family. This family consists of the Fic (filamentation induced by cAMP) protein and its relatives. The Fic protein is involved in cell division and is suggested to be involved in the synthesis of PAB or folate, indicating that the Fic protein 288742 Sulfotransferase protein 288743 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 288743 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 288746 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 288747 7 transmembrane receptor (metabotropic glutamate family) 288749 START domain 288752 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 288755 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 288758 Eukaryotic-type carbonic anhydrase 288764 Mitosis protein DIM1 288764 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 288768 Zinc finger, C3HC4 type (RING finger) 288770 NAC domain 288772 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 288774 SH2 domain 288774 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. Th 288774 STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT 288774 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fac 288777 Ribosomal protein S19 288780 ADP-ribosylation factor family 288780 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 288783 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 288786 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 288787 Tellurite resistance protein TehB 288787 ubiE/COQ5 methyltransferase family 288788 7 transmembrane receptor (rhodopsin family) 288789 7 transmembrane receptor (rhodopsin family) 288790 7 transmembrane receptor (rhodopsin family) 288791 ENV polyprotein (coat polyprotein) 288791 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288792 7 transmembrane receptor (rhodopsin family) 288794 7 transmembrane receptor (rhodopsin family) 288795 7 transmembrane receptor (rhodopsin family) 288796 7 transmembrane receptor (rhodopsin family) 288797 7 transmembrane receptor (rhodopsin family) 288798 7 transmembrane receptor (rhodopsin family) 288799 7 transmembrane receptor (rhodopsin family) 288800 7 transmembrane receptor (rhodopsin family) 288801 7 transmembrane receptor (rhodopsin family) 288802 7 transmembrane receptor (rhodopsin family) 288803 7 transmembrane receptor (rhodopsin family) 288805 7 transmembrane receptor (rhodopsin family) 288806 7 transmembrane receptor (rhodopsin family) 288807 7 transmembrane receptor (rhodopsin family) 288808 7 transmembrane receptor (rhodopsin family) 288808 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288809 7 transmembrane receptor (rhodopsin family) 288810 ENV polyprotein (coat polyprotein) 288810 7 transmembrane receptor (rhodopsin family) 288811 7 transmembrane receptor (rhodopsin family) 288813 7 transmembrane receptor (rhodopsin family) 288814 7 transmembrane receptor (rhodopsin family) 288815 7 transmembrane receptor (rhodopsin family) 288816 7 transmembrane receptor (rhodopsin family) 288817 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 288817 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288818 7 transmembrane receptor (rhodopsin family) 288819 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 288819 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288821 Helix-loop-helix DNA-binding domain 288822 7 transmembrane receptor (rhodopsin family) 288823 7 transmembrane receptor (rhodopsin family) 288824 7 transmembrane receptor (rhodopsin family) 288825 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288825 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288826 7 transmembrane receptor (rhodopsin family) 288827 7 transmembrane receptor (rhodopsin family) 288828 7 transmembrane receptor (rhodopsin family) 288829 7 transmembrane receptor (rhodopsin family) 288830 7 transmembrane receptor (rhodopsin family) 288831 7 transmembrane receptor (rhodopsin family) 288832 7 transmembrane receptor (rhodopsin family) 288833 7 transmembrane receptor (rhodopsin family) 288834 7 transmembrane receptor (rhodopsin family) 288835 7 transmembrane receptor (rhodopsin family) 288836 7 transmembrane receptor (rhodopsin family) 288837 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 288838 7 transmembrane receptor (rhodopsin family) 288839 7 transmembrane receptor (rhodopsin family) 288840 7 transmembrane receptor (rhodopsin family) 288840 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 288840 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 288841 7 transmembrane receptor (rhodopsin family) 288841 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 288843 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 288844 7 transmembrane receptor (rhodopsin family) 288845 7 transmembrane receptor (rhodopsin family) 288846 7 transmembrane receptor (rhodopsin family) 288847 7 transmembrane receptor (rhodopsin family) 288848 7 transmembrane receptor (rhodopsin family) 288850 7 transmembrane receptor (rhodopsin family) 288851 7 transmembrane receptor (rhodopsin family) 288852 7 transmembrane receptor (rhodopsin family) 288853 7 transmembrane receptor (rhodopsin family) 288854 7 transmembrane receptor (rhodopsin family) 288855 7 transmembrane receptor (rhodopsin family) 288855 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 288858 7 transmembrane receptor (rhodopsin family) 288859 7 transmembrane receptor (rhodopsin family) 288861 7 transmembrane receptor (rhodopsin family) 288862 7 transmembrane receptor (rhodopsin family) 288864 7 transmembrane receptor (rhodopsin family) 288865 7 transmembrane receptor (rhodopsin family) 288866 7 transmembrane receptor (rhodopsin family) 288867 7 transmembrane receptor (rhodopsin family) 288868 7 transmembrane receptor (rhodopsin family) 288869 7 transmembrane receptor (rhodopsin family) 288870 7 transmembrane receptor (rhodopsin family) 288871 7 transmembrane receptor (rhodopsin family) 288872 7 transmembrane receptor (rhodopsin family) 288873 7 transmembrane receptor (rhodopsin family) 288873 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 288873 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 288874 7 transmembrane receptor (rhodopsin family) 288875 7 transmembrane receptor (rhodopsin family) 288876 7 transmembrane receptor (rhodopsin family) 288876 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288877 7 transmembrane receptor (rhodopsin family) 288878 7 transmembrane receptor (rhodopsin family) 288879 7 transmembrane receptor (rhodopsin family) 288880 7 transmembrane receptor (rhodopsin family) 288881 7 transmembrane receptor (rhodopsin family) 288882 Ribosomal protein S8e 288883 Ribosomal protein S8e 288884 7 transmembrane receptor (rhodopsin family) 288885 7 transmembrane receptor (rhodopsin family) 288886 7 transmembrane receptor (rhodopsin family) 288887 7 transmembrane receptor (rhodopsin family) 288889 7 transmembrane receptor (rhodopsin family) 288891 7 transmembrane receptor (rhodopsin family) 288892 7 transmembrane receptor (rhodopsin family) 288894 7 transmembrane receptor (rhodopsin family) 288895 7 transmembrane receptor (rhodopsin family) 288897 7 transmembrane receptor (rhodopsin family) 288898 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288898 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288899 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 288906 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 288911 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fact 288912 Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fact 288915 CTF/NF-I family 288917 tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain 288918 Ribonuclease HII 288919 Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell 288922 Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine s 288923 Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine s 288925 Uncharacterised protein family (UPF0139) 288929 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 288930 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 288930 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 288932 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288933 S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 288933 S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 288933 S-adenosylmethionine synthetase, N-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 288936 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288937 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 288941 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 288946 Peptidase family M49 288948 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 288949 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288949 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288950 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 288953 Ribosomal S3Ae family 288956 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 288959 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 288963 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288965 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288966 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288967 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288967 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 288969 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 288974 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 288975 ENV polyprotein (coat polyprotein) 288979 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 288980 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 288982 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 288983 Translationally controlled tumor protein 288986 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 288988 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 288989 Ribosomal protein L36e 288990 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 288990 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 288993 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 288995 7 transmembrane receptor (rhodopsin family) 288999 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 289001 Protein kinase domain 289003 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 289005 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289005 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289006 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 289007 Thrombospondin type 1 domain 289010 Sushi domain (SCR repeat) 289010 Sushi domain (SCR repeat) 289013 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 289014 Protein kinase domain 289016 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 289020 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 289021 PX domain. PX domains bind to phosphoinositides 289022 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 289027 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 289032 Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse 289040 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 289041 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289043 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289047 LIM domain. This family represents two copies of the LIM structural domain 289048 Homeobox domain 289048 LIM domain. This family represents two copies of the LIM structural domain 289048 LIM domain. This family represents two copies of the LIM structural domain 289049 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 289050 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289050 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289051 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 289051 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 289052 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 289055 Sushi domain (SCR repeat) 289056 Sushi domain (SCR repeat) 289057 Sushi domain (SCR repeat) 289058 Sushi domain (SCR repeat) 289060 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 289064 Ribosomal protein S5, C-terminal domain 289067 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 289068 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 289069 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 289070 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 289071 ENV polyprotein (coat polyprotein) 289074 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 289075 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 289076 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 289077 Ribosomal L29e protein family 289078 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 289080 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 289081 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var 289085 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 289085 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 289089 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 289090 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 289091 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289097 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 289098 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 289099 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 289100 Ribosomal L15 289103 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 289107 Ribosomal L15 289109 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 289111 Ribosomal protein L23 289114 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289117 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289117 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289118 Ribosomal protein S27 289120 Class I Histocompatibility antigen, domains alpha 1 and 2 289120 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 289121 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m 289122 Ribosomal S17 289123 Acyl CoA binding protein 289124 Acyl CoA binding protein 289125 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 289128 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 289128 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 289129 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 289134 Enolase, N-terminal domain 289134 Enolase, C-terminal TIM barrel domain 289137 TSC-22/dip/bun family 289138 Zinc-binding dehydrogenase 289143 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 289145 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 289147 7 transmembrane receptor (rhodopsin family) 289151 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 289152 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 289154 GDP dissociation inhibitor 289155 Actin 289156 Amidase 289157 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289157 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289158 Homeobox domain 289160 Synaptobrevin 289165 Ribosomal protein S8e 289166 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 289172 Cyclophilin type peptidyl-prolyl cis-trans isomerase 289173 Adenylate kinase 289175 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analogu 289180 7 transmembrane receptor (rhodopsin family) 289181 WD domain, G-beta repeat 289182 Uncharacterised protein family (UPF0041) 289187 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 289189 'Cold-shock' DNA-binding domain 289190 HMG (high mobility group) box 289191 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 289191 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 289192 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 289192 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 289193 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 289193 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 289195 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 289197 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 289198 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 289199 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 289200 LIM domain. This family represents two copies of the LIM structural domain 289201 Homeobox domain 289201 LIM domain. This family represents two copies of the LIM structural domain 289203 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 289205 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 289208 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 289210 Ribosomal protein S24e 289212 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 289213 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 289217 Succinate dehydrogenase cytochrome b subunit 289218 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 289219 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 289222 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 289222 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 289223 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 289225 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289225 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289234 Death effector domain 289238 7 transmembrane receptor (rhodopsin family) 289239 7 transmembrane receptor (rhodopsin family) 289240 7 transmembrane receptor (rhodopsin family) 289241 7 transmembrane receptor (rhodopsin family) 289242 7 transmembrane receptor (rhodopsin family) 289243 Pentaxin family. Pentaxins are also known as pentraxins 289245 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 289245 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 289246 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 289246 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 289247 7 transmembrane receptor (rhodopsin family) 289248 7 transmembrane receptor (rhodopsin family) 289249 7 transmembrane receptor (rhodopsin family) 289250 7 transmembrane receptor (rhodopsin family) 289251 7 transmembrane receptor (rhodopsin family) 289252 7 transmembrane receptor (rhodopsin family) 289253 7 transmembrane receptor (rhodopsin family) 289254 7 transmembrane receptor (rhodopsin family) 289255 7 transmembrane receptor (rhodopsin family) 289256 7 transmembrane receptor (rhodopsin family) 289257 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 289258 7 transmembrane receptor (rhodopsin family) 289259 7 transmembrane receptor (rhodopsin family) 289260 7 transmembrane receptor (rhodopsin family) 289261 7 transmembrane receptor (rhodopsin family) 289264 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 289264 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe 289265 Ribosomal S3Ae family 289266 GDP dissociation inhibitor 289267 7 transmembrane receptor (rhodopsin family) 289269 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 289276 Adenylosuccinate synthetase 289279 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 289282 Elongation factor 1 gamma, conserved domain 289284 Ribosomal protein L14p/L23e 289288 Protein kinase domain 289290 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 289291 Protein kinase domain 289292 7 transmembrane receptor (metabotropic glutamate family) 289292 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289293 7 transmembrane receptor (metabotropic glutamate family) 289294 7 transmembrane receptor (metabotropic glutamate family) 289295 7 transmembrane receptor (metabotropic glutamate family) 289295 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289296 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289296 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289297 Low molecular weight phosphotyrosine protein phosphatase 289298 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289299 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289300 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289301 7 transmembrane receptor (metabotropic glutamate family) 289301 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289302 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 289302 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289302 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289303 7 transmembrane receptor (metabotropic glutamate family) 289303 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289304 7 transmembrane receptor (metabotropic glutamate family) 289305 7 transmembrane receptor (metabotropic glutamate family) 289305 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289311 Homeobox domain 289312 Acyl CoA binding protein 289314 Core histone H2A/H2B/H3/H4 289316 Transforming growth factor beta like domain 289316 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 289317 Delta 1-pyrroline-5-carboxylate reductase 289318 Transforming growth factor beta like domain 289318 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 289319 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 289320 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 289322 Fatty acid desaturase 289323 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 289324 Cornichon protein 289325 WD domain, G-beta repeat 289326 Ribosomal protein L36e 289330 Ribosomal protein S11 289333 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 289335 Sushi domain (SCR repeat) 289336 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 289337 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 289337 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 289339 Ribosomal protein L6 289343 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 289344 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 289349 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289349 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289353 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 289354 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 289355 Ribosomal protein L21e 289357 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 289358 Ribosomal protein L21e 289364 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 289365 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 289367 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 289368 Thrombospondin N-terminal -like domain 289369 Fibronectin type III domain 289370 Fibronectin type III domain 289374 MYND finger 289378 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 289380 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 289381 Ribosomal L29e protein family 289384 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 289390 Ribosomal protein S5, C-terminal domain 289391 Ribosomal protein L21e 289392 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 289394 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 289395 Sushi domain (SCR repeat) 289396 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289397 WD domain, G-beta repeat 289401 Ribosomal protein L31e 289403 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 289404 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 289405 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289407 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 289408 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 289411 7 transmembrane receptor (rhodopsin family) 289414 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 289415 Actin 289416 Prolyl oligopeptidase family 289417 Sushi domain (SCR repeat) 289419 Protein kinase domain 289420 Ribosomal L15 289421 Cyclophilin type peptidyl-prolyl cis-trans isomerase 289422 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289424 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m 289425 Fibronectin type III domain 289426 Ribosomal protein L31e 289427 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 289428 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 289430 MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold 289431 7 transmembrane receptor (metabotropic glutamate family) 289432 7 transmembrane receptor (metabotropic glutamate family) 289432 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289432 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 289433 7 transmembrane receptor (metabotropic glutamate family) 289433 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289434 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289434 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289435 7 transmembrane receptor (metabotropic glutamate family) 289436 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 289438 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 289439 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 289439 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 289440 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 289445 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 289446 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 289448 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 289448 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 289450 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 289451 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 289452 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 289456 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 289457 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 289459 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 289460 Protein-tyrosine phosphatase 289460 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 289462 PH domain. PH stands for pleckstrin homology 289469 Ribosomal protein S18 289473 NAC domain 289475 Ribosomal protein S2 289481 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 289481 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 289482 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 289485 Protein kinase domain 289488 Giardia variant-specific surface protein 289490 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 289491 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 289498 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 289501 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 289503 'Cold-shock' DNA-binding domain 289505 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 289505 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 289509 Ribosomal protein L21e 289510 Ribosomal protein S6e 289516 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 289517 Enolase, N-terminal domain 289517 Enolase, C-terminal TIM barrel domain 289518 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 289522 60Kd inner membrane protein 289531 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 289532 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289533 UDP-glucoronosyl and UDP-glucosyl transferase 289533 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 289534 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 289534 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 289535 UDP-glucoronosyl and UDP-glucosyl transferase 289535 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 289536 UDP-glucoronosyl and UDP-glucosyl transferase 289536 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 289537 UDP-glucoronosyl and UDP-glucosyl transferase 289537 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 289538 UDP-glucoronosyl and UDP-glucosyl transferase 289538 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 289539 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 289540 UDP-glucoronosyl and UDP-glucosyl transferase 289541 UDP-glucoronosyl and UDP-glucosyl transferase 289543 UDP-glucoronosyl and UDP-glucosyl transferase 289543 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 289544 Trypsin 289545 ATP synthase subunit C 289546 Trypsin 289547 Trypsin 289547 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 289549 Protein kinase domain 289551 Fibronectin type III domain 289557 Ribosomal protein L36e 289560 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 289561 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase comp 289565 ADP-ribosylation factor family 289565 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 289569 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 289570 Phosducin 289571 Uncharacterized protein family UPF0016. This family contains integral membrane proteins of unknown function. Most members of the family contain two copies of a region that contains an EXGD motif. Each of these regions contains three predicted transmembran 289574 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 289575 Protein kinase domain 289581 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 289588 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 289596 Low-density lipoprotein receptor domain class A 289596 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 289596 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 289596 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 289600 Trypsin 289606 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 289606 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 289607 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 289608 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 289610 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 289611 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 289612 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 289612 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 289614 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 289618 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289619 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289619 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289623 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 289623 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289625 Glycosyl hydrolase family 1 289627 Glycosyl hydrolase family 1 289628 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 289630 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 289638 Cyclophilin type peptidyl-prolyl cis-trans isomerase 289639 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289639 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289641 WD domain, G-beta repeat 289642 7 transmembrane receptor (metabotropic glutamate family) 289643 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 289647 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 289648 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 289651 C-5 cytosine-specific DNA methylase 289654 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 289656 Ribosomal protein L23 289662 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 289663 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 289664 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in 289665 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in 289666 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in 289668 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 289671 Intermediate filament protein 289671 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 289672 Hsp90 protein 289673 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289676 Ribosomal protein L13e 289677 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 289679 Ribosomal protein S6e 289680 Ribosomal S3Ae family 289686 Galactoside-binding lectin 289687 Glycosyl hydrolase family 1 289691 Ribosomal L10 289692 Ribosomal protein L36e 289693 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 289699 Eukaryotic initiation factor 4E 289701 Ribosomal protein S17 289706 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 289710 ENV polyprotein (coat polyprotein) 289715 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 289720 Saposin A-type domain 289721 LIM domain. This family represents two copies of the LIM structural domain 289722 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289724 Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production 289727 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 289734 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 289738 PH domain. PH stands for pleckstrin homology 289741 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 289743 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 289743 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 289744 Ribosomal protein L6 289747 LIF / OSM family 289749 Intermediate filament protein 289749 Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165 289750 Growth-Arrest-Specific Protein 2 Domain 289751 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 289753 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 289755 Eukaryotic ribosomal protein L18 289756 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289757 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 289757 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 289761 P53 289764 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289764 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289767 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289767 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289770 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 289771 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 289777 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 289779 Ribosomal protein L13 289784 Core histone H2A/H2B/H3/H4 289786 NAC domain 289793 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 289794 Protein-tyrosine phosphatase 289795 Ribosomal protein L36e 289797 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 289798 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289798 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289799 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289799 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289801 Phosphorylase family. Members of this family include: purine nucleoside phosphorylase (PNP) Uridine phosphorylase (UdRPase) 5'-methylthioadenosine phosphorylase (MTA phosphorylase) 289804 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 289806 'Cold-shock' DNA-binding domain 289808 Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII 289814 Ribosomal protein L21e 289818 Ribosomal protein S5, C-terminal domain 289818 Ribosomal protein S5, N-terminal domain 289821 Ribosomal protein S27 289824 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 289825 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 289827 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid 289828 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289829 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 289830 Ribosomal L28e protein family 289835 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 289843 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289843 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289844 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289844 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289848 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 289849 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 289851 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 289852 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 289853 Ribosomal protein S26e 289858 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 289862 Ribosomal protein L19e 289867 WD domain, G-beta repeat 289869 Acylphosphatase 289871 Macrophage migration inhibitory factor (MIF) 289872 Ribosomal protein L21e 289873 7 transmembrane receptor (rhodopsin family) 289877 Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi 289878 3'5'-cyclic nucleotide phosphodiesterase 289879 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 289879 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 289881 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 289883 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 289884 Ribosomal S3Ae family 289887 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 289888 UDP-glucoronosyl and UDP-glucosyl transferase 289889 7 transmembrane receptor (metabotropic glutamate family) 289890 7 transmembrane receptor (metabotropic glutamate family) 289890 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289891 7 transmembrane receptor (metabotropic glutamate family) 289891 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 289891 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 289897 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 289897 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 289899 7 transmembrane receptor (rhodopsin family) 289902 Ribosomal protein S16 289910 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 289914 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform 289915 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 289919 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 289920 ENV polyprotein (coat polyprotein) 289921 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 289923 7 transmembrane receptor (rhodopsin family) 289924 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 289927 Cadherin domain 289929 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 289930 Translationally controlled tumor protein 289932 Ribosomal protein S7e 289933 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 289934 Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains 289936 HMG (high mobility group) box 289939 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 289943 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 289944 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 289950 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 289950 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 289951 Filamin/ABP280 repeat 289957 7 transmembrane receptor (rhodopsin family) 289961 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 289961 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 289965 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 289968 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 289971 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 289973 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 289974 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 289975 Glutaredoxin 289976 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 289988 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 289989 Core histone H2A/H2B/H3/H4 289990 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 289990 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 289993 Protein-tyrosine phosphatase 289993 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 289994 Cornichon protein 289996 Translationally controlled tumor protein 290000 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290001 7 transmembrane receptor (rhodopsin family) 290002 7 transmembrane receptor (rhodopsin family) 290003 7 transmembrane receptor (rhodopsin family) 290004 7 transmembrane receptor (rhodopsin family) 290005 7 transmembrane receptor (rhodopsin family) 290006 7 transmembrane receptor (rhodopsin family) 290007 7 transmembrane receptor (rhodopsin family) 290008 7 transmembrane receptor (rhodopsin family) 290009 7 transmembrane receptor (rhodopsin family) 290010 7 transmembrane receptor (rhodopsin family) 290011 7 transmembrane receptor (rhodopsin family) 290012 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 290013 7 transmembrane receptor (rhodopsin family) 290014 7 transmembrane receptor (rhodopsin family) 290015 7 transmembrane receptor (rhodopsin family) 290017 7 transmembrane receptor (rhodopsin family) 290018 7 transmembrane receptor (rhodopsin family) 290018 Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos 290019 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 290020 7 transmembrane receptor (rhodopsin family) 290021 7 transmembrane receptor (rhodopsin family) 290022 7 transmembrane receptor (rhodopsin family) 290023 7 transmembrane receptor (rhodopsin family) 290024 7 transmembrane receptor (rhodopsin family) 290025 7 transmembrane receptor (rhodopsin family) 290026 Ribosomal protein L35Ae 290027 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 290028 Glycoprotease family 290029 Phosphorylase family 2 290030 S-adenosyl-L-homocysteine hydrolase 290031 Proteasome A-type and B-type 290033 Core histone H2A/H2B/H3/H4 290035 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 290036 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 290039 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 290040 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 290043 Ribosomal protein L31e 290044 7 transmembrane receptor (rhodopsin family) 290045 Ribosomal protein L19e 290046 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 290049 7 transmembrane receptor (rhodopsin family) 290050 7 transmembrane receptor (rhodopsin family) 290051 7 transmembrane receptor (rhodopsin family) 290055 7 transmembrane receptor (rhodopsin family) 290056 7 transmembrane receptor (rhodopsin family) 290057 Enolase, N-terminal domain 290057 Enolase, C-terminal TIM barrel domain 290058 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290059 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290061 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290062 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290064 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290066 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290067 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290069 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290070 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290071 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290072 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290073 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290074 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290075 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290076 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290077 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290078 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290079 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290080 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290081 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290084 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290085 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290086 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290087 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290088 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290089 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290090 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290091 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290092 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290093 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290094 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290095 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290097 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290098 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290099 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290101 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290104 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290105 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290106 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290107 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290108 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290110 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290111 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290112 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290113 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290114 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290115 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290117 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290118 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290119 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290120 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290121 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290122 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290123 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290124 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290125 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290130 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290131 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290132 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290133 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290134 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290135 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290136 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290137 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290138 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290139 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290140 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290141 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290145 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290147 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290148 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290149 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290150 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290151 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290152 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290154 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290155 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290156 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290157 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290161 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290162 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290163 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290164 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290165 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290166 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290167 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290168 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290169 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290170 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290172 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290173 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290174 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290175 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290176 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290178 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290179 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290181 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290182 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290185 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290186 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290187 Hsp90 protein 290188 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290190 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290191 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290192 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290196 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290197 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290199 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290202 7 transmembrane receptor (rhodopsin family) 290203 Ribosomal protein L34e 290204 60Kd inner membrane protein 290206 Proteasome A-type and B-type 290207 Cadherin domain 290207 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 290210 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 290211 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 290215 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 290218 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 290219 Ribosomal L29e protein family 290221 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 290222 Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate 290224 Uncharacterised protein family (UPF0172) 290226 Ribosomal protein S19 290229 Protein kinase domain 290231 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 290233 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 290239 Ribosomal protein L21e 290240 Trypsin 290241 Trypsin 290242 Trypsin 290243 Trypsin 290244 Trypsin 290245 Trypsin 290247 Trypsin 290248 Trypsin 290249 Trypsin 290250 Trypsin 290251 Trypsin 290252 Trypsin 290253 Trypsin 290255 Trypsin 290256 Trypsin 290257 Trypsin 290259 Trypsin 290260 Trypsin 290261 Trypsin 290262 Trypsin 290263 Trypsin 290264 Trypsin 290265 Trypsin 290266 Trypsin 290267 Trypsin 290271 Ribosomal L39 protein 290272 KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG 290276 Ribosomal protein L23 290277 3-hydroxyacyl-CoA dehydrogenase, C-terminal domain. This family also includes lambda crystallin. Some proteins include two copies of this domain 290281 Ribosomal protein L35Ae 290283 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 290288 ENV polyprotein (coat polyprotein) 290291 Mo25 protein family 290292 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 290295 Ets-domain 290295 E1-E2 ATPase 290298 Cyclophilin type peptidyl-prolyl cis-trans isomerase 290299 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 290301 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 290303 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 290306 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 290310 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290310 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290311 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 290312 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 290317 HMG (high mobility group) box 290319 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 290323 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 290324 Ribosomal S17 290326 Protein kinase domain 290332 Ribosomal protein L6, N-terminal domain 290338 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290340 'chromo' (CHRromatin Organization MOdifier) domain 290341 WD domain, G-beta repeat 290342 WD domain, G-beta repeat 290344 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 290344 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 290344 Disintegrin 290344 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 290344 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290345 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 290346 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 290347 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 290348 ENV polyprotein (coat polyprotein) 290350 Lysyl oxidase 290352 Death domain 290354 LIM domain. This family represents two copies of the LIM structural domain 290354 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 290359 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 290360 Importin-beta N-terminal domain 290361 PTB domain (IRS-1 type) 290363 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 290364 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 290365 NUDIX domain 290367 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 290368 ENV polyprotein (coat polyprotein) 290370 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 290376 Ribosomal protein S5, C-terminal domain 290377 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 290378 PH domain. PH stands for pleckstrin homology 290379 Protein kinase domain 290384 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 290386 Hsp90 protein 290387 Zinc-binding dehydrogenase 290388 Ribosomal S17 290390 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 290392 ENV polyprotein (coat polyprotein) 290400 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 290401 Dienelactone hydrolase family 290401 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 290402 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 290406 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 290409 Olfactomedin-like domain 290410 NOL1/NOP2/sun family 290412 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 290412 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 290414 Ribosomal L15 290415 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 290415 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 290422 Eukaryotic porin 290426 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 290426 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 290428 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 290429 Pou domain - N-terminal to homeobox domain 290432 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 290433 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 290434 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 290434 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 290435 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 290438 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290441 Phosphotyrosine interaction domain (PTB/PID) 290442 LIM domain. This family represents two copies of the LIM structural domain 290443 Aminotransferase class I and II 290445 Nucleoside diphosphate kinase 290449 Ribosomal L29e protein family 290452 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290452 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290458 Zinc finger, C3HC4 type (RING finger) 290459 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 290459 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 290461 ENV polyprotein (coat polyprotein) 290463 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290463 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290465 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290465 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290466 Actin 290467 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 290470 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 290471 Ribosomal protein S17 290472 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 290474 Glypican 290476 Glypican 290477 Ribosomal protein L21e 290479 Ribosomal protein S6e 290480 Glypican 290481 Glypican 290483 Glypican 290484 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 290485 PMP-22/EMP/MP20/Claudin family 290492 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290492 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290493 7 transmembrane receptor (rhodopsin family) 290496 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 290497 Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe 290498 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 290499 FKBP-type peptidyl-prolyl cis-trans isomerase 290500 Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans 290501 TPR Domain 290503 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 290503 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 290507 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 290510 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 290511 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290512 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290513 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290514 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290515 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290516 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290521 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 290522 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 290524 Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans 290528 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 290530 Homeobox domain 290530 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 290531 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 290534 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 290537 Phosphotyrosine interaction domain (PTB/PID) 290538 Homeobox domain 290541 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 290542 Ribosomal protein L21e 290544 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 290547 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 290548 Ribosomal protein L21e 290548 Ribosomal protein S7e 290551 GMC oxidoreductase. This family of proteins bind FAD as a cofactor 290556 GTPase of unknown function 290557 HMG (high mobility group) box 290562 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 290566 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 290567 Choline/Carnitine o-acyltransferase 290570 PH domain. PH stands for pleckstrin homology 290574 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290575 LIM domain. This family represents two copies of the LIM structural domain 290592 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 290594 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 290596 Uncharacterized protein family UPF0005 290597 Acyl CoA binding protein 290597 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 290601 ENV polyprotein (coat polyprotein) 290604 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290605 Ribosomal protein S2 290606 ENV polyprotein (coat polyprotein) 290607 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 290608 CAP protein 290610 Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein 290611 SH2 domain 290613 Myosin head (motor domain) 290613 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 290616 Calreticulin family 290620 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290623 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 290624 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 290625 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290632 Ribosomal protein L34 290633 GTPase of unknown function 290634 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290636 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 290637 Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var 290638 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 290641 Ribosomal L18ae protein family 290644 Gamma interferon inducible lysosomal thiol reductase (GILT). This family includes the two characterized human gamma-interferon-inducible lysosomal thiol reductase (GILT) sequences. It also contains several other eukaryotic putative proteins with similarit 290645 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 290646 3'5'-cyclic nucleotide phosphodiesterase 290647 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 290648 Pyroglutamyl peptidase 290651 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play 290656 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 290657 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 290657 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 290658 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 290660 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 290661 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 290662 Mitochondrial carrier protein 290666 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 290670 Protein kinase domain 290674 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 290676 Ribosomal protein L17 290677 Skp1 family, tetramerisation domain 290678 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 290680 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 290681 N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation o 290682 Enolase, C-terminal TIM barrel domain 290682 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 290683 Ribosomal protein L35Ae 290685 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 290685 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 290685 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 290685 D-xylulose 5-phosphate/D-fructose 6-phosphate phosphoketolase. Bacterial enzyme splits fructose-6-P and/or xylulose-5-P with the aid of inorganic phosphate into either acetyl-P and erythrose-4-P and/or acetyl-P and glyeraldehyde-3-P EC:4.1.2.9, EC:4.1.2.2 290685 Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto 290687 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290687 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290690 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 290690 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 290691 Actin 290692 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 290694 CUB domain 290694 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family 290695 CUB domain 290696 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290696 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290697 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 290698 Ribosomal L10 290700 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290700 Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus 290704 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290707 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290708 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 290711 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 290713 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 290716 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290716 Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus 290719 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290719 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290720 Phosphatidylethanolamine-binding protein 290721 Enolase, C-terminal TIM barrel domain 290724 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290726 Disintegrin 290726 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290726 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 290727 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 290728 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 290729 Formamidopyrimidine-DNA glycosylase 290733 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 290734 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 290738 Ribosomal protein S27 290741 Cytidine and deoxycytidylate deaminase zinc-binding region 290744 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 290752 Mitochondrial carrier protein 290756 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290756 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290757 Trypsin 290757 PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some t 290758 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 290760 Ribosomal protein L21e 290761 Fructose-bisphosphate aldolase class-I 290763 Disintegrin 290763 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 290763 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290766 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 290767 Zinc finger, C3HC4 type (RING finger) 290767 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 290768 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 290768 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 290772 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 290773 Disintegrin 290773 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290773 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 290774 Disintegrin 290774 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290774 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 290775 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 290778 WD domain, G-beta repeat 290781 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 290781 MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain sh 290784 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290784 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290787 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 290789 PMP-22/EMP/MP20/Claudin family 290792 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 290794 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 290795 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 290797 Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protein 290799 Ribosomal protein L21e 290802 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 290804 Ribosomal protein S8 290805 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 290805 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 290805 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin 290807 Ribosomal protein S8 290810 Hsp90 protein 290812 Zinc finger, C3HC4 type (RING finger) 290817 Ribosomal protein S7e 290818 Protein kinase domain 290819 Protein kinase domain 290820 Ribosomal protein L13 290825 Ribosomal protein L21e 290826 Calcium-activated BK potassium channel alpha subunit 290827 Ribosomal protein S28e 290829 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 290834 Disintegrin 290834 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 290837 Ribosomal protein S8e 290840 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 290843 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 290844 Homeobox domain 290846 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 290849 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290849 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290851 Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking 290852 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 290856 Defensin propeptide 290857 Defensin propeptide 290858 Mammalian defensin 290858 Defensin propeptide 290859 Defensin propeptide 290860 Mammalian defensin 290860 Defensin propeptide 290861 Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonization 290862 ENV polyprotein (coat polyprotein) 290864 Ubiquitin carboxyl-terminal hydrolase family 2 290866 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290867 Ubiquitin carboxyl-terminal hydrolase family 2 290868 tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a 290869 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 290869 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 290875 TPR Domain 290878 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 290879 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 290880 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisin 290881 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 290885 CUB domain 290885 Sushi domain (SCR repeat) 290887 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290888 CUB domain 290888 Sushi domain (SCR repeat) 290890 CUB domain 290890 Sushi domain (SCR repeat) 290891 Sushi domain (SCR repeat) 290892 Sushi domain (SCR repeat) 290894 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 290897 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 290898 tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases 290901 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290901 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290902 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 290902 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 290905 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 290907 ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653 290907 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 290908 Ribosomal protein S12 290909 Mitochondrial carrier protein 290916 Ribosomal protein L23 290917 HMG (high mobility group) box 290918 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 290921 Transforming growth factor beta like domain 290921 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 290922 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 290923 Asparaginase 290926 PBX domain. The PBX domain is a bipartite acidic domain 290928 Ribosomal protein L6 290931 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 290935 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 290936 Ribosomal protein S8e 290936 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 290937 e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit 290937 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 290937 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 290944 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 290944 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 290947 Flavodoxin 290947 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 290947 FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase 290948 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290949 Ribosomal protein L21e 290950 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290951 Protein kinase domain 290953 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 290954 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 290954 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 290956 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 290956 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 290956 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 290957 7 transmembrane receptor (rhodopsin family) 290960 Ribosomal protein L15 290962 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 290963 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 290964 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290965 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290966 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290968 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 290969 Papain family cysteine protease 290969 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 290971 Papain family cysteine protease 290977 Papain family cysteine protease 290977 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 290980 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 290981 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 290982 SAM dependent carboxyl methyltransferase. This family of plant methyltransferases contains enzymes that act on a variety of substrates including salicylic acid, jasmonic acid and 7-Methylxanthine. Caffeine is synthesized through sequential three-step meth 290983 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 290983 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 290984 PAP/25A associated domain 290984 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 290985 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 290987 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 290989 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 290990 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 290991 Actin 290994 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-2 290996 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 291000 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 291003 Eukaryotic initiation factor 4E 291004 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 291006 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 291007 Cyclophilin type peptidyl-prolyl cis-trans isomerase 291009 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291014 Protein kinase domain 291016 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 291017 Ribosomal protein L6e 291023 Helix-loop-helix DNA-binding domain 291028 Ribosomal protein L21e 291029 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 291030 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 291031 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291035 Ribosomal L29e protein family 291036 Ribosomal protein L36e 291038 Ribosomal protein L21e 291044 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 291047 GCM motif protein 291050 Ribosomal protein L6 291051 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 291053 Ribosomal protein L13 291055 Microtubule associated protein (MAP65/ASE1 family) 291057 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 291059 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291059 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291062 Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across 291063 Triosephosphate isomerase 291064 Ribosomal protein L6 291064 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 291065 Enolase, C-terminal TIM barrel domain 291067 tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain 291069 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 291073 Ribosomal protein L19e 291075 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 291076 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 291078 Protein kinase domain 291079 HMG (high mobility group) box 291082 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 291083 Ribosomal protein L13e 291084 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 291085 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 291086 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 291087 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 291089 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 291090 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 291091 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 291094 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 291095 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 291097 Fork head domain 291100 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt 291103 Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines 291103 pfam02921, UCR_TM, Ubiquinol cytochrome reductase transmembrane region. Each subunit of the cytochrome bc1 complex provides a single helix (this family) to make up the transmembrane region of the complex 291104 Ribosomal protein L21e 291106 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 291113 Core histone H2A/H2B/H3/H4 291114 Somatotropin hormone family 291115 Somatotropin hormone family 291116 Somatotropin hormone family 291117 Somatotropin hormone family 291119 Somatotropin hormone family 291121 Somatotropin hormone family 291122 Ribosomal protein L19e 291122 Somatotropin hormone family 291123 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 291124 Somatotropin hormone family 291126 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 291126 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 291130 Doublecortin 291131 Doublecortin 291132 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 291133 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 291134 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 291135 Uncharacterized protein PaaI, COG2050 291138 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 291138 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 291141 Core histone H2A/H2B/H3/H4 291142 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 291143 Core histone H2A/H2B/H3/H4 291144 Core histone H2A/H2B/H3/H4 291145 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 291146 Zinc finger, C3HC4 type (RING finger) 291146 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 291147 Protein kinase domain 291147 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291148 Core histone H2A/H2B/H3/H4 291148 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 291149 Core histone H2A/H2B/H3/H4 291152 Core histone H2A/H2B/H3/H4 291153 Core histone H2A/H2B/H3/H4 291153 Uracil DNA glycosylase superfamily 291154 Core histone H2A/H2B/H3/H4 291154 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 291155 Core histone H2A/H2B/H3/H4 291156 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 291157 Core histone H2A/H2B/H3/H4 291158 Core histone H2A/H2B/H3/H4 291158 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 291159 Core histone H2A/H2B/H3/H4 291160 Core histone H2A/H2B/H3/H4 291161 7 transmembrane receptor (rhodopsin family) 291162 7 transmembrane receptor (rhodopsin family) 291167 HMG (high mobility group) box 291169 Glutathione peroxidase 291171 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 291172 7 transmembrane receptor (rhodopsin family) 291173 7 transmembrane receptor (rhodopsin family) 291175 7 transmembrane receptor (rhodopsin family) 291179 7 transmembrane receptor (rhodopsin family) 291179 7 transmembrane receptor (rhodopsin family) 291180 Ependymin 291183 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 291188 Homeobox domain 291189 CAIB/BAIF family. This is a family of enzymes with diverse function, including fatty-acid CoA racemase enzymes such as arylpropionyl-CoA epimerase a key enzyme in the inversion metabolism of ibuprofen, carnitine dehydratase (CAIB) (EC 4.2.1.89) and bile a 291190 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291190 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291192 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291193 Macrophage migration inhibitory factor (MIF) 291198 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 291198 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 291200 Cyclophilin type peptidyl-prolyl cis-trans isomerase 291201 CBF/Mak21 family 291204 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 291205 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 291208 C2 domain 291211 Polyprenyl synthetase 291217 Ribosomal protein S24e 291220 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 291221 ribosomal L5P family C-terminus. This region is found associated with pfam00281 291222 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 291222 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 291223 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 291224 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 291224 Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus 291226 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 291227 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 291229 Ribosomal protein S11 291233 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 291236 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291238 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 291239 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 291240 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 291244 Ribosomal protein S7e 291248 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 291249 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 291252 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 291256 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 291258 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 291260 ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be e 291263 NUDIX domain 291266 NUDIX domain 291268 Ribosomal protein S5, N-terminal domain 291269 NUDIX domain 291270 NUDIX domain 291271 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 291272 NUDIX domain 291273 Elongation factor 1 gamma, conserved domain 291273 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 291275 NUDIX domain 291276 NUDIX domain 291277 Elongation factor 1 gamma, conserved domain 291280 MCM2/3/5 family 291281 Ribosomal protein L21e 291283 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 291284 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 291285 Eukaryotic porin 291287 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 291287 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 291291 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 291292 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 291293 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t 291296 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 291296 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291297 Ribosomal protein S6e 291299 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291301 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 291306 Actin 291307 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 291308 Ribosomal protein L21e 291314 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 291314 AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d 291315 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 291315 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 291317 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 291318 Myristoyl-CoA:protein N-myristoyltransferase, C-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold 291324 C-5 cytosine-specific DNA methylase 291325 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 291327 Lectin C-type domain. This family includes both long and short form C-type 291328 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 291329 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291329 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291330 Ribosomal protein S2 291332 Low-density lipoprotein receptor domain class A 291332 MAM domain. An extracellular domain found in many receptors 291333 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 291334 MAM domain. An extracellular domain found in many receptors 291335 MAM domain. An extracellular domain found in many receptors 291336 MAM domain. An extracellular domain found in many receptors 291340 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 291344 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 291345 Protein kinase domain 291346 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 291347 Ribosomal protein L23 291349 Protein kinase domain 291350 Protein kinase domain 291350 Myosin head (motor domain) 291352 7 transmembrane receptor (metabotropic glutamate family) 291352 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 291353 Ribosomal protein S8e 291354 Ribosomal protein L23 291355 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 291358 Cyclophilin type peptidyl-prolyl cis-trans isomerase 291359 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 291359 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 291360 Somatotropin hormone family 291361 7 transmembrane receptor (rhodopsin family) 291367 Ribosomal protein L21e 291369 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 291372 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 291375 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 291376 PTB domain (IRS-1 type) 291379 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 291379 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 291382 Replication factor-A protein 1, N-terminal domain 291382 OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata 291386 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 291388 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 291394 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 291403 Zinc-binding dehydrogenase 291407 Ribosomal protein L6 291413 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 291414 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 291414 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 291415 Mitosis protein DIM1 291415 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 291416 Ribosomal protein L10 291417 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291417 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291426 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 291427 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 291428 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 291432 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 291434 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 291435 Mitochondrial carrier protein 291436 Lipase 291437 Lipase 291437 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 291442 Cadherin domain 291442 TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin 291443 Ribosomal L10 291444 Protein of unknown function DUF84. The function of this prokaryotic protein family is unknown 291449 Ribosomal protein L35Ae 291450 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 291451 Eukaryotic porin 291452 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 291454 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 291455 metallopeptidase family M24 291456 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291456 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291457 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291458 HMG (high mobility group) box 291461 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 291461 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 291463 Cyclophilin type peptidyl-prolyl cis-trans isomerase 291465 PX domain. PX domains bind to phosphoinositides 291465 Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2 291466 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 291468 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 291468 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 291470 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 291472 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291474 Ribosomal protein L19e 291474 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291477 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291478 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291480 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291483 Ribosomal protein L19e 291483 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291484 Ribosomal protein L19e 291485 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291486 7 transmembrane receptor (rhodopsin family) 291487 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291488 Ribosomal protein L19e 291489 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291490 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291491 Ribosomal protein L19e 291494 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291495 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291496 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291499 Ribosomal protein L19e 291501 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291503 Glutaredoxin 291503 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 291504 Ribosomal protein L19e 291505 Ribosomal protein L19e 291506 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291507 7 transmembrane receptor (rhodopsin family) 291508 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291509 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291510 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291511 Ribosomal protein L19e 291514 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291516 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291519 Ribosomal protein L19e 291520 Ribosomal protein L19e 291521 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291522 7 transmembrane receptor (rhodopsin family) 291523 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291523 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291524 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291524 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291526 impB/mucB/samB family. These proteins are involved in UV protection 291527 START domain 291532 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291532 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291536 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291536 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291537 Ribosomal L29e protein family 291538 Thymosin beta-4 family 291541 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 291542 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291542 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291543 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291543 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291545 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291545 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291547 Eukaryotic ribosomal protein L18 291548 Ribosomal protein S28e 291551 S25 ribosomal protein 291553 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 291556 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 291559 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 291560 Ribosomal protein S7e 291561 Ribosomal protein L31e 291562 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 291564 PH domain. PH stands for pleckstrin homology 291568 Protein kinase domain 291568 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 291569 Homeobox domain 291581 Ribosomal protein L21e 291582 AMP-binding enzyme 291585 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 291586 DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after 291588 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291589 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291590 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 291591 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 291593 Ribosomal protein S2 291594 Ribosomal protein S2 291597 B-box zinc finger 291597 Fibronectin type III domain 291598 Ribosomal protein L6 291599 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 291603 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291603 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291604 NUDIX domain 291605 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 291606 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 291607 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 291611 Ribosomal protein S5, C-terminal domain 291613 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 291615 Poly-adenylate binding protein, unique domain 291615 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291617 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291617 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291618 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 291619 Ribosomal protein L36e 291621 Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site 291621 Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases 291621 Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto 291622 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 291623 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291623 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 291626 Ribosomal protein S5, C-terminal domain 291635 Cadherin domain 291636 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 291636 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 291637 Cadherin domain 291638 Cadherin domain 291639 Cadherin domain 291640 Mitochondrial carrier protein 291641 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 291642 Cadherin domain 291643 Cadherin domain 291644 Cadherin domain 291645 Cadherin domain 291646 Cadherin domain 291647 Cadherin domain 291648 Cadherin domain 291648 Ribosomal protein L15 amino terminal region. This family is always associated with pfam00256. This family is diagnostic of ribosomal L15 proteins 291649 Cadherin domain 291650 Fructose-bisphosphate aldolase class-I 291651 Cadherin domain 291652 Cadherin domain 291653 Cadherin domain 291654 Cadherin domain 291655 Cadherin domain 291656 Cadherin domain 291666 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 291668 Neuregulin family 291669 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 291670 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 291671 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 291678 wnt family 291679 Nucleoside diphosphate kinase 291680 Nucleoside diphosphate kinase 291681 wnt family 291682 wnt family 291685 SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. The 291687 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 291688 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 291690 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 291691 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 291692 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 291696 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 291697 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 291698 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 291699 START domain 291701 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 291701 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 291704 Protein kinase domain 291706 Actin 291707 LIM domain. This family represents two copies of the LIM structural domain 291709 Mitochondrial carrier protein 291711 Actin 291712 Eukaryotic ribosomal protein L18 291713 ADP-ribosylation factor family 291713 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 291715 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 291715 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 291717 Chaperonin 10 Kd subunit 291718 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291719 Mitochondrial carrier protein 291721 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291728 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291729 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 291729 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 291733 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 291734 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 291738 Ribosomal protein S2 291740 Ribosomal L10 291740 Ribosomal protein L16 291741 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 291742 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 291745 Cyclophilin type peptidyl-prolyl cis-trans isomerase 291749 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 291752 Cadherin domain 291753 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 291753 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 291754 Cadherin domain 291754 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 291755 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 291757 Ribosomal protein L3 291758 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 291759 Cadherin domain 291760 Cadherin domain 291761 Elongation factor 1 gamma, conserved domain 291761 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 291761 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 291762 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 291764 7 transmembrane receptor (metabotropic glutamate family) 291766 Cyclophilin type peptidyl-prolyl cis-trans isomerase 291767 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291768 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 291770 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 291771 Fibrillarin 291772 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 291774 Proteasome A-type and B-type 291776 Proteasome A-type and B-type 291782 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 291786 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291786 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291791 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 291794 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 291795 RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits 291796 Ubiquitin carboxyl-terminal hydrolase family 2 291797 Death domain 291798 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 291799 ADP-ribosylation factor family 291800 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 291800 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 291800 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 291801 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 291802 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 291805 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 291807 Ribosomal protein L19e 291808 Ribosomal protein L19e 291814 Carboxylesterase 291817 Carboxylesterase 291818 Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenyla 291819 Eukaryotic-type carbonic anhydrase 291822 Microtubule associated protein (MAP65/ASE1 family) 291823 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 291824 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 291826 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 291827 NAC domain 291828 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 291837 'Cold-shock' DNA-binding domain 291838 RNA polymerases N / 8 kDa subunit 291840 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 291846 Trypsin 291854 7 transmembrane receptor (Secretin family) 291854 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 291860 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 291862 Carboxylesterase 291863 Carboxylesterase 291864 Carboxylesterase 291871 Hsp90 protein 291871 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 291872 Hsp90 protein 291875 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 291879 Ribosomal protein L13e 291881 Protein of unknown function, DUF259 291881 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 291884 Ribosomal protein S5, C-terminal domain 291884 Ribosomal protein S5, N-terminal domain 291885 MCM2/3/5 family 291886 Core histone H2A/H2B/H3/H4 291889 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 291890 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 291894 Myosin head (motor domain) 291902 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 291906 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 291909 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291911 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 291919 'Cold-shock' DNA-binding domain 291920 Ribosomal L29e protein family 291923 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 291923 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 291923 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 291926 Protein kinase domain 291929 7 transmembrane receptor (rhodopsin family) 291930 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 291931 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 291933 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 291933 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 291935 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 291937 Anaphase-promoting complex, subunit 10 (APC10) 291938 Possible metal-binding domain in RNase L inhibitor, RLI. Possible metal-binding domain in endoribonuclease RNase L inhibitor. Found at the N-terminal end of RNase L inhibitor proteins, adjacent to the 4Fe-4S binding domain, fer4, pfam00037. Also often fou 291939 ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems 291941 Ribosomal L29e protein family 291943 Ribosomal protein L21e 291946 Domain of unknown function 291948 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 291949 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 291950 Core histone H2A/H2B/H3/H4 291951 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 291952 Carboxylesterase 291953 Carboxylesterase 291954 Carboxylesterase 291955 Carboxylesterase 291957 Ribosomal protein S8 291958 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 291965 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 291969 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 291972 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 291980 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 291981 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 291983 Proteasome A-type and B-type 291984 Renal dipeptidase 291985 Cadherin domain 291986 Cadherin domain 291986 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 291988 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 291989 Polypeptide deformylase 291991 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 291993 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 291994 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 291997 Ribosomal protein L11, RNA binding domain 291999 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 292002 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 292006 ENV polyprotein (coat polyprotein) 292008 Mitosis protein DIM1 292008 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 292013 Ribosomal protein S24e 292015 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 292016 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple 292017 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 292018 Ribosomal L29e protein family 292019 CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pr 292022 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 292022 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 292023 DHHA1 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA1 for DHH associated domain. This domain is diagnostic of DHH subfamily 1 members. This domains is also found in alanyl tRNA synthetase, suggesting that this d 292025 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 292026 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292026 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292028 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 292030 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 292031 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 292033 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 292036 Skp1 family, dimerisation domain 292036 Skp1 family, tetramerisation domain 292037 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 292038 NUDIX domain 292039 Zinc-binding dehydrogenase 292041 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 292044 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 292045 'chromo' (CHRromatin Organization MOdifier) domain 292046 Ribosomal protein L21e 292048 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 292048 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 292048 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six co 292049 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 292054 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 292057 Adenosine-deaminase (editase) domain 292057 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 292058 Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.` 292060 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt 292072 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 292073 Sulfatase 292073 Protein of unknown function (DUF229). Members of this family are uncharacterised. They are 500-1200 amino acids in length and share a long region conservation that probably corresponds to several domains 292074 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 292077 Sulfotransferase protein 292079 Protein kinase domain 292079 Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function 292083 7 transmembrane receptor (rhodopsin family) 292084 Ribosomal protein L35Ae 292085 Nup133 nucleoporin. RNA undergoing nuclear export first encounters the basket of the nuclear pore. Nup133 is a nucleoporin accessible on the basket side of the pore 292089 PMP-22/EMP/MP20/Claudin family 292096 Ribosomal L29e protein family 292097 Arv1-like family. Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis 292104 Ribosomal L29e protein family 292106 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 292107 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 292109 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 292115 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 292117 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 292119 Ribosomal protein L11, RNA binding domain 292123 S25 ribosomal protein 292124 Zona pellucida-like domain 292126 Trypsin 292126 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 292129 MYND finger 292129 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 292130 PH domain. PH stands for pleckstrin homology 292133 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 292134 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292135 Ribosomal protein S19e 292137 Lipase 292138 Enolase, C-terminal TIM barrel domain 292141 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 292143 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 292147 Ribosomal protein L35Ae 292149 Tricarboxylate carrier 292150 Tricarboxylate carrier 292151 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 292152 Carboxylesterase 292153 Carboxylesterase 292154 Carboxylesterase 292156 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 292170 IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing 292172 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 292173 Transcription factor TFIID (or TATA-binding protein, TBP) 292174 Ribosomal protein S5, C-terminal domain 292176 Peptidase C16 family 292176 Ribosomal protein L21e 292177 Ribosomal L29e protein family 292178 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 292180 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 292181 Sugar (and other) transporter 292182 pfam02892, zf-BED, BED zinc finger 292186 ENV polyprotein (coat polyprotein) 292192 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292193 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 292195 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 292196 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 292201 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 292203 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292206 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 292211 Ribosomal protein L24e 292212 Ribosomal protein L24e 292213 Ribosomal protein L24e 292214 Protein kinase domain 292215 Protein kinase domain 292217 Protein kinase domain 292218 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 292219 Ribosomal protein L24e 292220 Protein kinase domain 292221 Protein kinase domain 292222 Protein kinase domain 292223 Protein kinase domain 292224 Protein kinase domain 292225 Protein kinase domain 292226 Protein kinase domain 292227 Protein kinase domain 292231 7 transmembrane receptor (rhodopsin family) 292231 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292232 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292233 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292234 Homeobox domain 292236 C2 domain 292238 Fibronectin type III domain 292241 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 292241 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 292242 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 292243 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 292244 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 292248 7 transmembrane receptor (metabotropic glutamate family) 292249 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292251 Sulfotransferase protein 292252 Phosphoglycerate kinase 292253 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 292255 Protein kinase domain 292256 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 292257 PH domain. PH stands for pleckstrin homology 292258 Ribosomal protein L13 292259 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 292261 Ribosomal protein L6 292262 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 292271 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 292279 Oxidoreductase FAD-binding domain 292279 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 292280 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 292284 Protein-tyrosine phosphatase 292284 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 292287 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 292289 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 292289 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 292294 Intermediate filament protein 292295 Poly-adenylate binding protein, unique domain 292295 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292296 Actin 292297 Core histone H2A/H2B/H3/H4 292301 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 292302 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 292304 Ribosomal protein L19e 292306 Ribonuclease T2 family 292308 Protein kinase domain 292310 7 transmembrane receptor (rhodopsin family) 292311 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 292313 Protein kinase domain 292314 Protein kinase domain 292316 Protein kinase domain 292316 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292317 Protein kinase domain 292319 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 292324 7 transmembrane receptor (rhodopsin family) 292325 Calpain family cysteine protease 292325 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 292327 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict 292327 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292330 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 292331 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 292332 Protein kinase domain 292334 Protein kinase domain 292334 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292335 Protein kinase domain 292335 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292336 Protein kinase domain 292336 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292338 Protein kinase domain 292340 Protein kinase domain 292340 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292342 Protein kinase domain 292342 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292343 Protein kinase domain 292346 Protein kinase domain 292346 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292347 Protein kinase domain 292348 Protein kinase domain 292348 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292350 Protein kinase domain 292351 Protein kinase domain 292352 Protein kinase domain 292353 Protein kinase domain 292354 Protein kinase domain 292354 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292355 Protein kinase domain 292357 Protein kinase domain 292358 Protein kinase domain 292358 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292360 Protein kinase domain 292360 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292362 Protein kinase domain 292363 Protein kinase domain 292364 Protein kinase domain 292365 Protein kinase domain 292366 Protein kinase domain 292367 Protein kinase domain 292368 Protein kinase domain 292368 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292371 Protein kinase domain 292371 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292372 Protein kinase domain 292376 Protein kinase domain 292376 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292377 Protein kinase domain 292379 Triosephosphate isomerase 292380 Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence 292383 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 292384 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292385 Ribosomal protein L23 292386 Protein kinase domain 292386 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292387 Protein kinase domain 292388 Protein kinase domain 292388 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292389 Protein kinase domain 292389 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292391 Protein kinase domain 292391 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292393 Protein kinase domain 292394 Protein kinase domain 292394 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292395 Protein kinase domain 292395 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292396 Protein kinase domain 292396 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292397 Hsp90 protein 292399 Protein kinase domain 292404 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 292405 Thrombospondin N-terminal -like domain 292407 Ribosomal L22e protein family 292409 7 transmembrane receptor (rhodopsin family) 292410 7 transmembrane receptor (rhodopsin family) 292411 7 transmembrane receptor (rhodopsin family) 292413 7 transmembrane receptor (rhodopsin family) 292414 7 transmembrane receptor (metabotropic glutamate family) 292414 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292415 7 transmembrane receptor (rhodopsin family) 292416 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292417 7 transmembrane receptor (metabotropic glutamate family) 292417 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292418 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292419 7 transmembrane receptor (rhodopsin family) 292420 7 transmembrane receptor (metabotropic glutamate family) 292420 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292421 7 transmembrane receptor (rhodopsin family) 292421 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292422 ENV polyprotein (coat polyprotein) 292422 7 transmembrane receptor (rhodopsin family) 292422 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292423 Protein kinase domain 292423 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 292425 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292426 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292428 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292430 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292431 7 transmembrane receptor (metabotropic glutamate family) 292432 7 transmembrane receptor (metabotropic glutamate family) 292432 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292433 7 transmembrane receptor (metabotropic glutamate family) 292433 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292436 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292437 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292438 7 transmembrane receptor (metabotropic glutamate family) 292438 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292440 7 transmembrane receptor (metabotropic glutamate family) 292440 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292441 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292442 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292443 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292444 7 transmembrane receptor (metabotropic glutamate family) 292445 7 transmembrane receptor (metabotropic glutamate family) 292445 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292447 7 transmembrane receptor (metabotropic glutamate family) 292447 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292452 7 transmembrane receptor (metabotropic glutamate family) 292464 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 292466 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 292467 Ets-domain 292471 ATP synthase alpha/beta chain, C terminal domain 292471 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 292473 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 292474 Ribosomal protein L15 292476 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 292478 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 292479 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 292482 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 292482 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 292483 Syntaxin 292485 Alpha-L-fucosidase 292488 7 transmembrane receptor (metabotropic glutamate family) 292488 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292489 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292490 7 transmembrane receptor (metabotropic glutamate family) 292493 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 292494 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292495 7 transmembrane receptor (metabotropic glutamate family) 292497 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292498 7 transmembrane receptor (metabotropic glutamate family) 292499 7 transmembrane receptor (metabotropic glutamate family) 292500 7 transmembrane receptor (metabotropic glutamate family) 292500 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 292503 7 transmembrane receptor (metabotropic glutamate family) 292504 7 transmembrane receptor (metabotropic glutamate family) 292505 7 transmembrane receptor (metabotropic glutamate family) 292505 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292506 7 transmembrane receptor (metabotropic glutamate family) 292506 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292508 7 transmembrane receptor (metabotropic glutamate family) 292508 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292509 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 292510 7 transmembrane receptor (metabotropic glutamate family) 292510 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292512 7 transmembrane receptor (metabotropic glutamate family) 292513 7 transmembrane receptor (metabotropic glutamate family) 292518 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 292519 ENV polyprotein (coat polyprotein) 292519 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 292521 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292523 Ribosomal protein S8 292524 Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold 292524 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292526 7 transmembrane receptor (rhodopsin family) 292526 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292528 7 transmembrane receptor (rhodopsin family) 292528 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292529 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292530 7 transmembrane receptor (metabotropic glutamate family) 292530 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292531 7 transmembrane receptor (metabotropic glutamate family) 292536 Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein 292539 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 292540 7 transmembrane receptor (rhodopsin family) 292542 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 292543 Amino acid permease 292544 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 292545 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 292546 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 292547 7 transmembrane receptor (metabotropic glutamate family) 292547 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292548 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292551 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292552 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292553 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292554 Protein kinase domain 292555 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292558 7 transmembrane receptor (rhodopsin family) 292559 7 transmembrane receptor (rhodopsin family) 292560 7 transmembrane receptor (rhodopsin family) 292561 7 transmembrane receptor (rhodopsin family) 292562 7 transmembrane receptor (rhodopsin family) 292563 7 transmembrane receptor (rhodopsin family) 292564 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292566 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 292566 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292567 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292571 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 292572 Protein kinase domain 292573 Homeobox domain 292574 Homeobox domain 292575 Homeobox domain 292576 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292578 Sulfotransferase protein 292578 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 292579 Sulfotransferase protein 292579 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 292580 7 transmembrane receptor (rhodopsin family) 292580 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292581 ICE-like protease (caspase) p10 domain 292581 ICE-like protease (caspase) p20 domain 292582 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 292583 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 292585 Protein kinase domain 292587 Isochorismatase family. This family are hydrolase enzymes 292588 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 292590 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 292593 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 292600 7 transmembrane receptor (rhodopsin family) 292600 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292601 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 292602 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 292602 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292603 Protein kinase domain 292604 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292607 7 transmembrane receptor (rhodopsin family) 292607 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292608 7 transmembrane receptor (rhodopsin family) 292608 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292609 7 transmembrane receptor (metabotropic glutamate family) 292610 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292612 7 transmembrane receptor (metabotropic glutamate family) 292612 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292615 7 transmembrane receptor (metabotropic glutamate family) 292615 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292617 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 292620 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292626 Ribosomal protein L23 292628 Homeobox domain 292629 Homeobox domain 292630 Homeobox domain 292631 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292632 Sulfotransferase protein 292633 Sulfotransferase protein 292634 Sulfotransferase protein 292635 Sulfotransferase protein 292637 Sulfotransferase protein 292639 CUB domain 292641 ENV polyprotein (coat polyprotein) 292641 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 292641 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 292642 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292643 Histidine acid phosphatase 292644 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 292645 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 292646 Fibronectin type II domain 292647 Homeobox domain 292648 Sulfotransferase protein 292651 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 292655 Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell 292656 Macrophage migration inhibitory factor (MIF) 292657 Sodium:dicarboxylate symporter family 292657 Sodium:dicarboxylate symporter family 292658 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 292659 Protein kinase domain 292661 7 transmembrane receptor (rhodopsin family) 292662 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 292664 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 292665 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 292667 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 292670 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 292671 Eukaryotic porin 292671 Class I Histocompatibility antigen, domains alpha 1 and 2 292672 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 292672 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 292673 Helix-hairpin-helix motif 292673 DNA repair protein rad10 292677 FKBP-type peptidyl-prolyl cis-trans isomerase 292681 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 292686 F-box domain 292687 Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally imp 292688 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 292693 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 292701 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 292702 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 292703 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292704 HMG (high mobility group) box 292705 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292710 Metallo-beta-lactamase superfamily 292719 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 292720 Ets-domain 292721 Ets-domain 292726 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 292727 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 292728 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 292729 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292740 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292741 'Cold-shock' DNA-binding domain 292742 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292743 Ribosomal protein L23 292744 Low molecular weight phosphotyrosine protein phosphatase 292745 von Willebrand factor type D domain 292746 von Willebrand factor type D domain 292746 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 292747 Fibrillarin 292749 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 292755 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 292756 Protein kinase domain 292756 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 292757 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 292758 Ribosomal protein S12 292760 Vacuolar sorting protein 9 (VPS9) domain 292761 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292763 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 292768 Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1 292770 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 292773 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292774 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292775 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292777 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 292778 RNA polymerases M/15 Kd subunit 292782 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 292783 Actin 292784 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 292784 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 292785 Proline dehydrogenase 292786 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 292798 Glycosyltransferase family 6 292799 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292802 Hsp90 protein 292803 Programmed cell death protein 2, C-terminal domain 292804 Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate 292807 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 292808 metallopeptidase family M24 292809 WD domain, G-beta repeat 292812 Ribosomal protein S5, C-terminal domain 292812 Ribosomal protein S5, N-terminal domain 292814 Protein of unknown function DUF122. This protein family has no known function 292815 ENV polyprotein (coat polyprotein) 292818 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 292819 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 292821 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292822 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292825 Ribosomal protein L13 292826 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292827 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292828 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 292828 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 292830 Domain of unknown function UPF0086. This small conserved archaeal protein has no known function 292831 Domain of unknown function UPF0086. This small conserved archaeal protein has no known function 292833 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 292833 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 292836 eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 292836 eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 292836 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 292837 7 transmembrane receptor (metabotropic glutamate family) 292837 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 292839 7 transmembrane receptor (metabotropic glutamate family) 292840 ENV polyprotein (coat polyprotein) 292841 Core histone H2A/H2B/H3/H4 292841 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 292841 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292842 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292845 Electron transfer flavoprotein beta subunit. This protein is distantly related to and forms a heterodimer with pfam00766 292847 Uncharacterized protein family UPF0021 292848 Trypsin 292849 Trypsin 292851 Trypsin 292852 Trypsin 292853 Trypsin 292854 Translation initiation factor SUI1 292855 Trypsin 292856 Trypsin 292857 Trypsin 292858 Trypsin 292859 Trypsin 292861 Trypsin 292862 Trypsin 292863 Trypsin 292864 Trypsin 292865 Trypsin 292866 Trypsin 292867 Trypsin 292868 Trypsin 292870 Trypsin 292870 ENV polyprotein (coat polyprotein) 292870 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 292870 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 292871 Trypsin 292872 Trypsin 292873 Trypsin 292876 Josephin 292879 Fibronectin type III domain 292879 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 292881 Ets-domain 292882 Ribosomal protein L19e 292884 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 292895 Ribosomal protein L13 292896 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 292902 Sodium:neurotransmitter symporter family 292903 Sodium:neurotransmitter symporter family 292906 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292909 Tub family 292910 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 292912 DIL domain. The DIL domain has no known function 292915 Sulfotransferase protein 292916 Synaptogyrin. This family of proteins is distantly related to pfam01284 292917 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 292918 von Willebrand factor type D domain 292918 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 292919 Helix-loop-helix DNA-binding domain 292919 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s 292922 Serum amyloid A protein 292924 Serum amyloid A protein 292925 Uncharacterized ACR, COG1579 292926 7 transmembrane receptor (rhodopsin family) 292927 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 292928 7 transmembrane receptor (rhodopsin family) 292929 7 transmembrane receptor (rhodopsin family) 292932 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 292934 Homeobox domain 292936 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 292936 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 292948 Protein kinase domain 292949 PMP-22/EMP/MP20/Claudin family 292955 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 292956 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 292958 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 292962 Enolase, N-terminal domain 292962 Enolase, C-terminal TIM barrel domain 292964 Ribosomal protein L44 292965 Ribosomal protein L15 292968 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle 292969 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle 292977 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 292979 Core histone H2A/H2B/H3/H4 292981 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 292981 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 292983 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 292992 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 292993 WD domain, G-beta repeat 292993 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 292994 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 292998 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 292999 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 293001 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 293002 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 293003 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 293004 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 293006 Ribosomal L15 293009 Cyclophilin type peptidyl-prolyl cis-trans isomerase 293010 Ribosomal protein S6e 293011 Ribosomal protein S5, N-terminal domain 293011 Ribosomal protein S5, C-terminal domain 293012 Helix-loop-helix DNA-binding domain 293014 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 293015 Protein kinase domain 293022 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293022 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293023 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 293023 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 293025 Ribosomal protein L13e 293026 Ribosomal protein S27 293026 Enolase, N-terminal domain 293026 Enolase, C-terminal TIM barrel domain 293027 Core histone H2A/H2B/H3/H4 293028 Eukaryotic DNA topoisomerase I, catalytic core. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombina 293028 pfam02919, Topoisomer_I_N, Eukaryotic DNA topoisomerase I, DNA binding fragment. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replicat 293029 Macrophage migration inhibitory factor (MIF) 293031 Ribosomal protein S7e 293032 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 293033 Ribosomal protein L21e 293035 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 293037 7 transmembrane receptor (rhodopsin family) 293039 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 293041 Protein kinase domain 293043 Isocitrate/isopropylmalate dehydrogenase 293044 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 293046 Helix-loop-helix DNA-binding domain 293049 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 293050 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 293051 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 293052 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 293053 Ribosomal protein S11 293059 Uncharacterized ACR, YggU family COG1872 293062 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 293063 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 293064 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 293066 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 293067 A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation 293067 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets in 293068 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 293069 7 transmembrane receptor (metabotropic glutamate family) 293069 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 293070 7 transmembrane receptor (rhodopsin family) 293072 7 transmembrane receptor (rhodopsin family) 293073 7 transmembrane receptor (metabotropic glutamate family) 293073 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 293075 7 transmembrane receptor (rhodopsin family) 293076 7 transmembrane receptor (rhodopsin family) 293077 7 transmembrane receptor (rhodopsin family) 293078 7 transmembrane receptor (rhodopsin family) 293080 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 293081 7 transmembrane receptor (rhodopsin family) 293082 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 293083 7 transmembrane receptor (rhodopsin family) 293084 7 transmembrane receptor (rhodopsin family) 293085 7 transmembrane receptor (rhodopsin family) 293086 7 transmembrane receptor (rhodopsin family) 293087 7 transmembrane receptor (rhodopsin family) 293088 7 transmembrane receptor (rhodopsin family) 293089 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 293090 7 transmembrane receptor (rhodopsin family) 293091 7 transmembrane receptor (rhodopsin family) 293092 7 transmembrane receptor (rhodopsin family) 293093 7 transmembrane receptor (metabotropic glutamate family) 293094 7 transmembrane receptor (rhodopsin family) 293094 7 transmembrane receptor (metabotropic glutamate family) 293095 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 293096 7 transmembrane receptor (metabotropic glutamate family) 293097 Nucleoside diphosphate kinase 293100 Trypsin 293101 Malic enzyme, N-terminal domain 293102 Malic enzyme, NAD binding domain 293106 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 293115 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 293119 Ribosomal protein L3 293120 ENV polyprotein (coat polyprotein) 293121 14-3-3 protein 293125 Ribosomal protein L23 293128 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 293128 OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata 293129 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m 293131 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 293132 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 293133 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 293133 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 293133 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 293134 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 293136 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 293140 7 transmembrane receptor (rhodopsin family) 293141 7 transmembrane receptor (rhodopsin family) 293145 Phosphoglucomutase/phosphomannomutase, C-terminal domain 293149 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 293150 START domain 293152 NAD:arginine ADP-ribosyltransferase 293153 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 293153 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 293154 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch 293162 PH domain. PH stands for pleckstrin homology 293165 HMG (high mobility group) box 293168 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 293169 Ribosomal protein S11 293171 metallopeptidase family M24 293173 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 293174 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 293176 Enolase, N-terminal domain 293177 Enolase, N-terminal domain 293177 Enolase, C-terminal TIM barrel domain 293179 TEA/ATTS domain family 293182 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 293186 Extracellular link domain 293187 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 293188 7 transmembrane receptor (rhodopsin family) 293189 7 transmembrane receptor (rhodopsin family) 293190 Bcl-2 homology region 4 293190 Apoptosis regulator proteins, Bcl-2 family 293191 7 transmembrane receptor (rhodopsin family) 293192 7 transmembrane receptor (rhodopsin family) 293193 7 transmembrane receptor (rhodopsin family) 293194 7 transmembrane receptor (rhodopsin family) 293195 7 transmembrane receptor (rhodopsin family) 293196 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293196 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293197 7 transmembrane receptor (rhodopsin family) 293198 7 transmembrane receptor (rhodopsin family) 293199 7 transmembrane receptor (rhodopsin family) 293201 7 transmembrane receptor (rhodopsin family) 293202 7 transmembrane receptor (rhodopsin family) 293203 7 transmembrane receptor (rhodopsin family) 293204 7 transmembrane receptor (rhodopsin family) 293205 7 transmembrane receptor (rhodopsin family) 293206 7 transmembrane receptor (rhodopsin family) 293207 7 transmembrane receptor (rhodopsin family) 293209 7 transmembrane receptor (rhodopsin family) 293210 7 transmembrane receptor (rhodopsin family) 293211 HMG (high mobility group) box 293211 7 transmembrane receptor (rhodopsin family) 293212 7 transmembrane receptor (rhodopsin family) 293214 7 transmembrane receptor (rhodopsin family) 293215 7 transmembrane receptor (rhodopsin family) 293216 7 transmembrane receptor (rhodopsin family) 293218 7 transmembrane receptor (rhodopsin family) 293220 7 transmembrane receptor (rhodopsin family) 293221 7 transmembrane receptor (rhodopsin family) 293222 7 transmembrane receptor (rhodopsin family) 293223 7 transmembrane receptor (rhodopsin family) 293224 7 transmembrane receptor (rhodopsin family) 293225 7 transmembrane receptor (rhodopsin family) 293226 7 transmembrane receptor (rhodopsin family) 293227 7 transmembrane receptor (rhodopsin family) 293228 7 transmembrane receptor (rhodopsin family) 293229 7 transmembrane receptor (rhodopsin family) 293230 7 transmembrane receptor (rhodopsin family) 293231 7 transmembrane receptor (rhodopsin family) 293232 7 transmembrane receptor (rhodopsin family) 293233 7 transmembrane receptor (rhodopsin family) 293234 7 transmembrane receptor (rhodopsin family) 293235 7 transmembrane receptor (rhodopsin family) 293236 7 transmembrane receptor (rhodopsin family) 293237 7 transmembrane receptor (rhodopsin family) 293238 7 transmembrane receptor (rhodopsin family) 293239 7 transmembrane receptor (rhodopsin family) 293240 7 transmembrane receptor (rhodopsin family) 293242 7 transmembrane receptor (rhodopsin family) 293243 7 transmembrane receptor (rhodopsin family) 293244 7 transmembrane receptor (rhodopsin family) 293245 7 transmembrane receptor (rhodopsin family) 293246 7 transmembrane receptor (rhodopsin family) 293247 7 transmembrane receptor (rhodopsin family) 293248 7 transmembrane receptor (rhodopsin family) 293249 7 transmembrane receptor (rhodopsin family) 293250 7 transmembrane receptor (rhodopsin family) 293251 7 transmembrane receptor (rhodopsin family) 293252 7 transmembrane receptor (rhodopsin family) 293253 7 transmembrane receptor (rhodopsin family) 293254 7 transmembrane receptor (rhodopsin family) 293255 7 transmembrane receptor (rhodopsin family) 293256 7 transmembrane receptor (rhodopsin family) 293257 7 transmembrane receptor (rhodopsin family) 293258 7 transmembrane receptor (rhodopsin family) 293259 7 transmembrane receptor (rhodopsin family) 293260 7 transmembrane receptor (rhodopsin family) 293261 7 transmembrane receptor (rhodopsin family) 293262 7 transmembrane receptor (rhodopsin family) 293263 7 transmembrane receptor (rhodopsin family) 293264 Globin 293265 Globin 293266 Globin 293267 Globin 293268 7 transmembrane receptor (rhodopsin family) 293269 7 transmembrane receptor (rhodopsin family) 293270 7 transmembrane receptor (rhodopsin family) 293271 7 transmembrane receptor (rhodopsin family) 293272 7 transmembrane receptor (rhodopsin family) 293273 7 transmembrane receptor (rhodopsin family) 293274 7 transmembrane receptor (rhodopsin family) 293275 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 293276 7 transmembrane receptor (rhodopsin family) 293278 7 transmembrane receptor (rhodopsin family) 293279 7 transmembrane receptor (rhodopsin family) 293280 7 transmembrane receptor (rhodopsin family) 293281 7 transmembrane receptor (rhodopsin family) 293282 7 transmembrane receptor (rhodopsin family) 293283 7 transmembrane receptor (rhodopsin family) 293285 7 transmembrane receptor (rhodopsin family) 293286 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 293286 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 293287 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 293288 Olfactomedin-like domain 293289 7 transmembrane receptor (rhodopsin family) 293290 7 transmembrane receptor (rhodopsin family) 293291 Ribosomal protein L36e 293292 7 transmembrane receptor (rhodopsin family) 293294 B-box zinc finger 293294 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 293295 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 293296 B-box zinc finger 293296 Zinc finger, C3HC4 type (RING finger) 293298 B-box zinc finger 293298 Zinc finger, C3HC4 type (RING finger) 293298 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 293299 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 293300 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 293301 7 transmembrane receptor (rhodopsin family) 293302 7 transmembrane receptor (rhodopsin family) 293303 7 transmembrane receptor (rhodopsin family) 293304 7 transmembrane receptor (rhodopsin family) 293305 7 transmembrane receptor (rhodopsin family) 293306 7 transmembrane receptor (rhodopsin family) 293307 7 transmembrane receptor (rhodopsin family) 293308 7 transmembrane receptor (rhodopsin family) 293309 7 transmembrane receptor (rhodopsin family) 293310 Ubiquitin carboxyl-terminal hydrolase family 2 293310 Ubiquitin carboxyl-terminal hydrolases family 2 293311 7 transmembrane receptor (rhodopsin family) 293312 7 transmembrane receptor (rhodopsin family) 293314 7 transmembrane receptor (rhodopsin family) 293315 7 transmembrane receptor (rhodopsin family) 293316 7 transmembrane receptor (rhodopsin family) 293317 Ubiquitin carboxyl-terminal hydrolase family 2 293317 Ubiquitin carboxyl-terminal hydrolases family 2 293318 7 transmembrane receptor (rhodopsin family) 293319 7 transmembrane receptor (rhodopsin family) 293320 7 transmembrane receptor (rhodopsin family) 293321 7 transmembrane receptor (rhodopsin family) 293322 7 transmembrane receptor (rhodopsin family) 293323 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 293324 7 transmembrane receptor (rhodopsin family) 293325 7 transmembrane receptor (rhodopsin family) 293326 7 transmembrane receptor (rhodopsin family) 293327 7 transmembrane receptor (rhodopsin family) 293328 7 transmembrane receptor (rhodopsin family) 293329 7 transmembrane receptor (rhodopsin family) 293330 7 transmembrane receptor (rhodopsin family) 293331 7 transmembrane receptor (rhodopsin family) 293332 Ribosomal L10 293333 7 transmembrane receptor (rhodopsin family) 293334 7 transmembrane receptor (rhodopsin family) 293335 7 transmembrane receptor (rhodopsin family) 293336 7 transmembrane receptor (rhodopsin family) 293337 7 transmembrane receptor (rhodopsin family) 293338 7 transmembrane receptor (rhodopsin family) 293339 7 transmembrane receptor (rhodopsin family) 293340 7 transmembrane receptor (rhodopsin family) 293341 7 transmembrane receptor (rhodopsin family) 293342 7 transmembrane receptor (rhodopsin family) 293345 Transcription initiation factor TFIID 23-30kDa subunit 293349 Ribosomal protein L24e 293350 7 transmembrane receptor (rhodopsin family) 293351 7 transmembrane receptor (rhodopsin family) 293352 7 transmembrane receptor (rhodopsin family) 293353 7 transmembrane receptor (rhodopsin family) 293354 7 transmembrane receptor (rhodopsin family) 293355 7 transmembrane receptor (rhodopsin family) 293356 7 transmembrane receptor (rhodopsin family) 293357 7 transmembrane receptor (rhodopsin family) 293358 7 transmembrane receptor (rhodopsin family) 293359 7 transmembrane receptor (rhodopsin family) 293361 7 transmembrane receptor (rhodopsin family) 293362 7 transmembrane receptor (rhodopsin family) 293363 7 transmembrane receptor (rhodopsin family) 293364 7 transmembrane receptor (rhodopsin family) 293365 7 transmembrane receptor (rhodopsin family) 293366 7 transmembrane receptor (rhodopsin family) 293367 7 transmembrane receptor (rhodopsin family) 293368 7 transmembrane receptor (rhodopsin family) 293369 7 transmembrane receptor (rhodopsin family) 293370 7 transmembrane receptor (rhodopsin family) 293371 7 transmembrane receptor (rhodopsin family) 293372 Endomembrane protein 70 293373 7 transmembrane receptor (rhodopsin family) 293374 7 transmembrane receptor (rhodopsin family) 293375 7 transmembrane receptor (rhodopsin family) 293376 7 transmembrane receptor (rhodopsin family) 293377 7 transmembrane receptor (rhodopsin family) 293378 PX domain. PX domains bind to phosphoinositides 293379 7 transmembrane receptor (rhodopsin family) 293380 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 293381 7 transmembrane receptor (rhodopsin family) 293382 7 transmembrane receptor (rhodopsin family) 293383 7 transmembrane receptor (rhodopsin family) 293384 7 transmembrane receptor (rhodopsin family) 293385 7 transmembrane receptor (rhodopsin family) 293386 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293386 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293387 7 transmembrane receptor (rhodopsin family) 293388 7 transmembrane receptor (rhodopsin family) 293389 7 transmembrane receptor (rhodopsin family) 293390 7 transmembrane receptor (rhodopsin family) 293391 7 transmembrane receptor (rhodopsin family) 293392 7 transmembrane receptor (rhodopsin family) 293394 7 transmembrane receptor (rhodopsin family) 293395 7 transmembrane receptor (rhodopsin family) 293396 7 transmembrane receptor (rhodopsin family) 293398 7 transmembrane receptor (rhodopsin family) 293399 7 transmembrane receptor (rhodopsin family) 293400 7 transmembrane receptor (rhodopsin family) 293401 7 transmembrane receptor (rhodopsin family) 293402 7 transmembrane receptor (rhodopsin family) 293403 7 transmembrane receptor (rhodopsin family) 293404 7 transmembrane receptor (rhodopsin family) 293408 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 293409 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 293409 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 293410 PH domain. PH stands for pleckstrin homology 293411 Importin-beta N-terminal domain 293412 Ribosomal protein L36e 293417 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293417 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293418 Ribosomal protein L15 293420 HMG (high mobility group) box 293421 7 transmembrane receptor (rhodopsin family) 293422 7 transmembrane receptor (rhodopsin family) 293423 7 transmembrane receptor (rhodopsin family) 293424 7 transmembrane receptor (rhodopsin family) 293425 7 transmembrane receptor (rhodopsin family) 293426 NB-ARC domain 293426 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 293427 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 293428 7 transmembrane receptor (rhodopsin family) 293429 7 transmembrane receptor (rhodopsin family) 293430 7 transmembrane receptor (rhodopsin family) 293431 7 transmembrane receptor (rhodopsin family) 293432 7 transmembrane receptor (rhodopsin family) 293435 7 transmembrane receptor (rhodopsin family) 293436 7 transmembrane receptor (rhodopsin family) 293437 7 transmembrane receptor (rhodopsin family) 293438 7 transmembrane receptor (rhodopsin family) 293439 7 transmembrane receptor (rhodopsin family) 293440 7 transmembrane receptor (rhodopsin family) 293441 7 transmembrane receptor (rhodopsin family) 293442 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 293443 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 293445 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 293446 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 293448 Insulinase (Peptidase family M16) 293451 Sulfotransferase protein 293451 Sulfotransferase protein 293453 Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu 293455 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 293458 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293458 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293461 Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence 293462 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 293462 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 293463 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293463 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293464 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 293464 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 293465 Sulfotransferase protein 293466 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 293467 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 293468 Sulfotransferase protein 293470 Ribosomal protein S8 293471 Hsp90 protein 293472 Ribosomal protein L13e 293480 Ribosomal protein L6 293481 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 293484 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 293485 CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease) 293490 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 293491 Yippee putative zinc-binding protein 293499 CUB domain 293504 Quinolinate phosphoribosyl transferase, C-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses 293505 GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein 293507 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 293508 Protein kinase domain 293509 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 293510 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 293510 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 293511 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 293515 Ciliary neurotrophic factor 293516 Trypsin 293519 von Willebrand factor type A domain 293519 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 293520 von Willebrand factor type A domain 293523 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 293525 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 293533 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 293534 CUB domain 293534 Zona pellucida-like domain 293534 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 293537 Homeobox domain 293538 Homeobox domain 293544 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 293545 7 transmembrane receptor (rhodopsin family) 293546 7 transmembrane receptor (metabotropic glutamate family) 293553 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293553 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293560 HMG (high mobility group) box 293565 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293565 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293566 Zinc carboxypeptidase 293567 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 293568 Homeobox domain 293572 HMG (high mobility group) box 293573 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 293573 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 293575 Cyclophilin type peptidyl-prolyl cis-trans isomerase 293577 Ribosomal protein L31e 293578 Fimbrial Usher protein. This protein is involved in biogenesis of gram negative bacterial pili 293578 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 293582 Ribosomal protein S5, C-terminal domain 293582 Ribosomal protein S5, N-terminal domain 293584 Ribosomal protein L36e 293588 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 293590 7 transmembrane receptor (rhodopsin family) 293592 7 transmembrane receptor (rhodopsin family) 293593 7 transmembrane receptor (rhodopsin family) 293594 7 transmembrane receptor (rhodopsin family) 293595 7 transmembrane receptor (rhodopsin family) 293596 7 transmembrane receptor (rhodopsin family) 293597 7 transmembrane receptor (rhodopsin family) 293598 7 transmembrane receptor (rhodopsin family) 293599 7 transmembrane receptor (rhodopsin family) 293600 7 transmembrane receptor (rhodopsin family) 293601 7 transmembrane receptor (rhodopsin family) 293602 7 transmembrane receptor (rhodopsin family) 293603 7 transmembrane receptor (rhodopsin family) 293604 7 transmembrane receptor (rhodopsin family) 293605 7 transmembrane receptor (rhodopsin family) 293606 7 transmembrane receptor (rhodopsin family) 293607 7 transmembrane receptor (rhodopsin family) 293608 7 transmembrane receptor (rhodopsin family) 293609 7 transmembrane receptor (rhodopsin family) 293612 Ribosomal protein L21e 293613 Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders 293615 Sir2 family 293616 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 293620 Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phosp 293621 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 293621 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 293625 Death domain 293626 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 293627 Tetraspanin family 293628 Glycosyl hydrolases family 18 293631 Protein kinase domain 293639 7 transmembrane receptor (rhodopsin family) 293640 NAD synthase. NAD synthase (EC:6.3.5.1) is involved in the de novo synthesis of NAD and is induced by stress factors such as heat shock and glucose limitation 293641 Actin 293642 Ribosomal protein L21e 293643 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 293646 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 293648 7 transmembrane receptor (rhodopsin family) 293650 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 293652 4Fe-4S binding domain. Superfamily includes proteins containing domains which bind to iron-sulfur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwi 293654 C2 domain 293655 Respiratory-chain NADH dehydrogenase 51 Kd subunit 293656 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 293656 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 293657 DNA polymerase delta, subunit 4 293660 ADP-ribosylation factor family 293660 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 293661 PH domain. PH stands for pleckstrin homology 293662 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 293663 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 293664 Ribosomal L29e protein family 293665 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 293666 Ribosomal protein L11, RNA binding domain 293666 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 293670 ADP-ribosylation factor family 293670 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 293670 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 293672 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 293673 Eukaryotic initiation factor 1A 293676 Papain family cysteine protease 293676 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 293678 ERCC4 domain. This domain is predicted to be a nuclease domain 293686 Sugar (and other) transporter 293688 Ergosterol biosynthesis ERG4/ERG24 family 293690 SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 p 293691 MIT domain 293691 PX domain. PX domains bind to phosphoinositides 293693 Protein kinase domain 293693 Intermediate filament protein 293693 Protein kinase C terminal domain 293693 PH domain. PH stands for pleckstrin homology 293693 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 293693 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 293693 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 293694 Protein kinase domain 293694 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 293696 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 293697 Ribosomal L29e protein family 293699 ATP synthase subunit C 293700 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 293701 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 293702 FKBP-type peptidyl-prolyl cis-trans isomerase 293704 Domain of unknown function DUF60. The proteins in this family have no known function. Two proteins have been annotated as phosphotransferases, however this is not supported experimentally, and should be viewed with caution 293710 Galactoside-binding lectin 293715 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 293719 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 293721 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 293725 Elongation factor 1 gamma, conserved domain 293725 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 293729 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 293730 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 293733 Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregatio 293735 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 293737 Core histone H2A/H2B/H3/H4 293739 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293740 PX domain. PX domains bind to phosphoinositides 293741 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293742 Ribosomal protein L6e 293743 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, 293743 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293744 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293746 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293747 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 293747 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 293748 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293749 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293751 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 293752 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 293754 Ribosomal protein L16 293757 Oxysterol-binding protein 293758 7 transmembrane receptor (rhodopsin family) 293759 7 transmembrane receptor (rhodopsin family) 293760 7 transmembrane receptor (rhodopsin family) 293761 7 transmembrane receptor (rhodopsin family) 293762 7 transmembrane receptor (rhodopsin family) 293763 7 transmembrane receptor (rhodopsin family) 293764 7 transmembrane receptor (rhodopsin family) 293765 7 transmembrane receptor (rhodopsin family) 293767 7 transmembrane receptor (rhodopsin family) 293768 7 transmembrane receptor (rhodopsin family) 293769 7 transmembrane receptor (rhodopsin family) 293770 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 293771 7 transmembrane receptor (rhodopsin family) 293773 7 transmembrane receptor (rhodopsin family) 293776 Fibrillarin 293778 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293778 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293780 7 transmembrane receptor (rhodopsin family) 293781 7 transmembrane receptor (rhodopsin family) 293783 LIM domain. This family represents two copies of the LIM structural domain 293784 7 transmembrane receptor (rhodopsin family) 293785 7 transmembrane receptor (rhodopsin family) 293786 7 transmembrane receptor (rhodopsin family) 293787 7 transmembrane receptor (rhodopsin family) 293788 7 transmembrane receptor (rhodopsin family) 293789 7 transmembrane receptor (rhodopsin family) 293790 7 transmembrane receptor (rhodopsin family) 293791 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 293792 7 transmembrane receptor (rhodopsin family) 293793 7 transmembrane receptor (rhodopsin family) 293794 7 transmembrane receptor (rhodopsin family) 293795 7 transmembrane receptor (rhodopsin family) 293796 7 transmembrane receptor (rhodopsin family) 293797 7 transmembrane receptor (rhodopsin family) 293798 BTG1 family. A novel family of anti-proliferative proteins 293799 7 transmembrane receptor (rhodopsin family) 293800 7 transmembrane receptor (rhodopsin family) 293801 7 transmembrane receptor (rhodopsin family) 293802 7 transmembrane receptor (rhodopsin family) 293803 7 transmembrane receptor (rhodopsin family) 293804 7 transmembrane receptor (rhodopsin family) 293805 7 transmembrane receptor (rhodopsin family) 293806 7 transmembrane receptor (rhodopsin family) 293808 7 transmembrane receptor (rhodopsin family) 293809 7 transmembrane receptor (rhodopsin family) 293810 7 transmembrane receptor (rhodopsin family) 293811 7 transmembrane receptor (rhodopsin family) 293812 7 transmembrane receptor (rhodopsin family) 293813 7 transmembrane receptor (rhodopsin family) 293814 7 transmembrane receptor (rhodopsin family) 293815 7 transmembrane receptor (rhodopsin family) 293816 7 transmembrane receptor (rhodopsin family) 293817 7 transmembrane receptor (rhodopsin family) 293818 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293820 Aminotransferase class-V 293827 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 293829 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 293831 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 293832 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 293833 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 293834 Intermediate filament protein 293835 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293836 HMG (high mobility group) box 293837 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 293839 LIM domain. This family represents two copies of the LIM structural domain 293842 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 293843 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 293843 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 293846 Ribosomal L28e protein family 293847 Protein-tyrosine phosphatase 293847 Putative tyrosine phosphatase family. This family consists of putative tyrosine phosphatase proteins, this function is inferred from several sequences at the top of the noise, EC:3.1.3.48 293847 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 293851 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 293855 Isocitrate/isopropylmalate dehydrogenase 293856 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 293858 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 293861 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 293864 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 293866 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 293867 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 293868 Ribosomal L29e protein family 293870 Ribosomal protein S19 293874 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 293875 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 293876 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293876 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293878 7 transmembrane receptor (rhodopsin family) 293880 Cyclophilin type peptidyl-prolyl cis-trans isomerase 293883 Phosphatidylethanolamine-binding protein 293884 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 293886 Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90 293887 Dynein light chain type 1 293894 DNA gyrase/topoisomerase IV, subunit A 293894 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a 293896 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 293899 Eukaryotic ribosomal protein L18 293903 DNA repair protein Rad4 293905 Ribosomal L15 293906 Alkaline phosphatase 293907 Ribosomal L10 293908 Occludin/ELL family 293909 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 293911 Eukaryotic porin 293913 HSF-type DNA-binding 293915 Mitochondrial carrier protein 293920 Ribosomal protein L13e 293921 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293921 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293924 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 293924 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 293924 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 293927 Sulfatase 293928 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 293929 Ribosomal protein S2 293931 Ribosomal protein L21e 293932 Ribosomal L29e protein family 293935 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 293937 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 293939 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 293940 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 293941 Ribosomal protein L10 293942 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 293945 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 293946 Thrombospondin N-terminal -like domain 293947 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 293950 Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are 293952 Josephin 293954 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 293955 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m 293957 Mitochondrial carrier protein 293958 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 293960 A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation 293960 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets in 293962 Ribosomal protein L21e 293964 Ribosomal protein S17 293967 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 293970 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 293971 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 293974 Phosphoglucomutase/phosphomannomutase, C-terminal domain 293974 pfam02880, PGM_PMM_III, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain III 293975 pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 293975 pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 293975 pfam02880, PGM_PMM_III, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain III 293976 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 293976 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 293978 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293979 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293980 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293981 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293982 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293984 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293985 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293987 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293988 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293989 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 293990 wnt family 293994 Ribosomal protein L21e 293996 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 293997 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 293998 Homeobox domain 294000 Homeobox domain 294001 Homeobox domain 294006 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 294007 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 294011 Tricarboxylate carrier 294014 Domain of unknown function DUF21. This transmembrane region has no known function. Many of the sequences in this family are annotated as hemolysins, however this is due to a similarity to a protein which does not contain this domain. This domain is found 294017 Ribosomal protein S5, C-terminal domain 294018 WD domain, G-beta repeat 294021 Translation initiation factor SUI1 294027 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 294029 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 294035 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 294035 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 294035 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 294036 Ribosomal protein L21e 294046 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 294047 Ribosomal protein L36e 294051 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 294051 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 294052 Ribosomal protein L21e 294053 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 294053 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 294056 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294056 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294057 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294057 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294058 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 294059 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 294059 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 294061 Proteasome A-type and B-type 294063 Ribosomal S17 294065 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 294065 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 294068 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 294069 Ribosomal protein S8e 294070 Ribosomal protein L15 294072 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 294075 7 transmembrane receptor (rhodopsin family) 294076 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 294077 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 294081 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 294081 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 294085 metallopeptidase family M24 294086 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 294087 Proteasome A-type and B-type 294088 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform 294092 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 294093 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 294094 ab-hydrolase associated lipase region 294094 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 294096 ab-hydrolase associated lipase region 294096 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 294097 ab-hydrolase associated lipase region 294097 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 294101 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 294102 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 294103 ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PA 294106 Actin 294109 7 transmembrane receptor (rhodopsin family) 294111 Ribosomal protein S21e 294114 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 294115 HMG (high mobility group) box 294116 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 294117 Ribosomal protein L19e 294119 DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions 294121 7 transmembrane receptor (rhodopsin family) 294122 7 transmembrane receptor (rhodopsin family) 294122 7 transmembrane receptor (rhodopsin family) 294123 7 transmembrane receptor (rhodopsin family) 294124 7 transmembrane receptor (rhodopsin family) 294124 7 transmembrane receptor (rhodopsin family) 294125 7 transmembrane receptor (rhodopsin family) 294125 7 transmembrane receptor (rhodopsin family) 294126 7 transmembrane receptor (rhodopsin family) 294126 7 transmembrane receptor (rhodopsin family) 294127 7 transmembrane receptor (rhodopsin family) 294128 7 transmembrane receptor (rhodopsin family) 294129 7 transmembrane receptor (rhodopsin family) 294130 7 transmembrane receptor (rhodopsin family) 294130 7 transmembrane receptor (rhodopsin family) 294131 7 transmembrane receptor (rhodopsin family) 294131 7 transmembrane receptor (rhodopsin family) 294132 7 transmembrane receptor (rhodopsin family) 294132 7 transmembrane receptor (rhodopsin family) 294133 7 transmembrane receptor (rhodopsin family) 294133 7 transmembrane receptor (rhodopsin family) 294134 7 transmembrane receptor (rhodopsin family) 294134 7 transmembrane receptor (rhodopsin family) 294136 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 294137 7 transmembrane receptor (rhodopsin family) 294138 7 transmembrane receptor (rhodopsin family) 294139 7 transmembrane receptor (rhodopsin family) 294140 7 transmembrane receptor (rhodopsin family) 294143 7 transmembrane receptor (rhodopsin family) 294143 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 294144 7 transmembrane receptor (rhodopsin family) 294145 7 transmembrane receptor (rhodopsin family) 294146 7 transmembrane receptor (rhodopsin family) 294147 7 transmembrane receptor (rhodopsin family) 294148 7 transmembrane receptor (rhodopsin family) 294149 7 transmembrane receptor (rhodopsin family) 294150 7 transmembrane receptor (rhodopsin family) 294151 7 transmembrane receptor (rhodopsin family) 294152 7 transmembrane receptor (rhodopsin family) 294153 7 transmembrane receptor (rhodopsin family) 294153 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 294155 7 transmembrane receptor (rhodopsin family) 294157 7 transmembrane receptor (rhodopsin family) 294158 7 transmembrane receptor (rhodopsin family) 294159 7 transmembrane receptor (rhodopsin family) 294160 7 transmembrane receptor (rhodopsin family) 294161 7 transmembrane receptor (rhodopsin family) 294162 7 transmembrane receptor (rhodopsin family) 294163 7 transmembrane receptor (rhodopsin family) 294164 7 transmembrane receptor (rhodopsin family) 294165 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 294165 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 294165 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 294166 7 transmembrane receptor (rhodopsin family) 294167 7 transmembrane receptor (rhodopsin family) 294168 7 transmembrane receptor (rhodopsin family) 294173 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 294174 Class I Histocompatibility antigen, domains alpha 1 and 2 294175 7 transmembrane receptor (rhodopsin family) 294176 7 transmembrane receptor (rhodopsin family) 294177 7 transmembrane receptor (rhodopsin family) 294179 7 transmembrane receptor (rhodopsin family) 294181 7 transmembrane receptor (rhodopsin family) 294183 7 transmembrane receptor (rhodopsin family) 294184 7 transmembrane receptor (rhodopsin family) 294185 7 transmembrane receptor (rhodopsin family) 294186 7 transmembrane receptor (rhodopsin family) 294187 7 transmembrane receptor (rhodopsin family) 294188 7 transmembrane receptor (rhodopsin family) 294189 7 transmembrane receptor (rhodopsin family) 294190 7 transmembrane receptor (rhodopsin family) 294191 7 transmembrane receptor (rhodopsin family) 294192 7 transmembrane receptor (rhodopsin family) 294193 7 transmembrane receptor (rhodopsin family) 294194 7 transmembrane receptor (rhodopsin family) 294195 Class I Histocompatibility antigen, domains alpha 1 and 2 294196 7 transmembrane receptor (rhodopsin family) 294197 7 transmembrane receptor (rhodopsin family) 294198 7 transmembrane receptor (rhodopsin family) 294199 7 transmembrane receptor (rhodopsin family) 294200 7 transmembrane receptor (rhodopsin family) 294201 7 transmembrane receptor (rhodopsin family) 294202 7 transmembrane receptor (rhodopsin family) 294203 7 transmembrane receptor (rhodopsin family) 294204 7 transmembrane receptor (rhodopsin family) 294205 Ribosomal protein S19 294208 B-box zinc finger 294208 Zinc finger, C3HC4 type (RING finger) 294208 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 294209 Zinc finger, C3HC4 type (RING finger) 294210 B-box zinc finger 294210 Zinc finger, C3HC4 type (RING finger) 294210 Phosphatidylethanolamine-binding protein 294210 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 294211 Class I Histocompatibility antigen, domains alpha 1 and 2 294212 Class I Histocompatibility antigen, domains alpha 1 and 2 294213 Class I Histocompatibility antigen, domains alpha 1 and 2 294213 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 294214 Domain of unknown function (DUF380). Domain of unknown function, present in a ribonuclease P subunit in humans. Possibly a metal-binding domain 294215 Class I Histocompatibility antigen, domains alpha 1 and 2 294216 Class I Histocompatibility antigen, domains alpha 1 and 2 294217 Class I Histocompatibility antigen, domains alpha 1 and 2 294218 Class I Histocompatibility antigen, domains alpha 1 and 2 294219 Class I Histocompatibility antigen, domains alpha 1 and 2 294220 Class I Histocompatibility antigen, domains alpha 1 and 2 294221 Class I Histocompatibility antigen, domains alpha 1 and 2 294222 Class I Histocompatibility antigen, domains alpha 1 and 2 294223 Class I Histocompatibility antigen, domains alpha 1 and 2 294224 Ribosomal protein S5, C-terminal domain 294224 Ribosomal protein S5, N-terminal domain 294225 Class I Histocompatibility antigen, domains alpha 1 and 2 294227 Class I Histocompatibility antigen, domains alpha 1 and 2 294228 Class I Histocompatibility antigen, domains alpha 1 and 2 294234 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 294234 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 294236 Transcription factor Tfb2 294237 Class I Histocompatibility antigen, domains alpha 1 and 2 294238 Class I Histocompatibility antigen, domains alpha 1 and 2 294239 Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzymes 294247 Casein kinase II regulatory subunit 294248 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 294252 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 294254 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 294254 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 294256 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 294257 Trypsin 294258 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 294262 Fibrinogen beta and gamma chains, C-terminal globular domain 294266 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 294269 Class II histocompatibility antigen, alpha domain 294269 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 294270 Class II histocompatibility antigen, beta domain 294270 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 294271 Class II histocompatibility antigen, beta domain 294271 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 294272 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins 294272 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 294272 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 294273 Class II histocompatibility antigen, beta domain 294274 Class II histocompatibility antigen, alpha domain 294278 Class II histocompatibility antigen, alpha domain 294278 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 294279 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 294282 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 294283 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 294286 Kinesin motor domain 294288 CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide vari 294289 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 294292 NUDIX domain 294293 Ets-domain 294293 Sterile alpha motif (SAM)/Pointed domain 294297 CUB domain 294299 TEA/ATTS domain family 294304 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 294307 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 294309 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 294312 SCP-like extracellular protein. This domain is also found in prokaryotes 294313 Mitochondrial carrier protein 294314 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 294316 NAC domain 294320 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 294326 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 294328 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 294328 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 294328 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 294331 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 294334 S-adenosyl-L-homocysteine hydrolase 294335 von Willebrand factor type D domain 294336 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 294337 von Willebrand factor type A domain 294338 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 294341 Sodium:solute symporter family 294342 Sodium:solute symporter family 294344 Ribosomal protein L13e 294347 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 294351 Ribosomal protein L21e 294352 Cadherin domain 294356 Actin 294358 Proteasome A-type and B-type 294359 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 294359 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 294361 Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones 294367 Ribosomal protein S5, N-terminal domain 294368 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 294370 Enolase, N-terminal domain 294370 Enolase, C-terminal TIM barrel domain 294376 Dynein light chain type 1 294377 Ribosomal protein S2 294379 Vinculin family 294380 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 294382 Vinculin family 294386 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 294391 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 294391 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 294392 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 294393 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294393 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294394 7 transmembrane receptor (metabotropic glutamate family) 294394 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 294395 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 294396 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 294399 Ribosomal protein L5 294399 ribosomal L5P family C-terminus. This region is found associated with pfam00281 294400 Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase 294401 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 294405 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294405 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294406 MCM2/3/5 family 294413 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 294415 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 294415 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 294416 Ribosomal L15 294417 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294417 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294418 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294418 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294420 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 294421 TMS membrane protein/tumour differentially expressed protein (TDE) 294421 TMS membrane protein/tumour differentially expressed protein (TDE) 294423 Homeobox domain 294424 Homeobox domain 294425 Homeobox domain 294426 Homeobox domain 294427 Homeobox domain 294428 Homeobox domain 294429 RanBP1 domain 294429 Cyclophilin type peptidyl-prolyl cis-trans isomerase 294434 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 294435 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 294436 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 294439 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 294440 Thrombospondin N-terminal -like domain 294444 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 294447 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294447 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294448 Ribosomal protein L13 294449 Sulfotransferase protein 294451 Cyclophilin type peptidyl-prolyl cis-trans isomerase 294453 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 294458 Homeobox domain 294459 Homeobox domain 294460 Homeobox domain 294461 Homeobox domain 294462 Homeobox domain 294463 Homeobox domain 294464 Homeobox domain 294465 Homeobox domain 294466 Homeobox domain 294467 Homeobox domain 294468 Homeobox domain 294469 Homeobox domain 294470 Homeobox domain 294471 Homeobox domain 294472 Homeobox domain 294473 Homeobox domain 294474 Homeobox domain 294475 Homeobox domain 294476 Homeobox domain 294477 Homeobox domain 294478 Homeobox domain 294479 Homeobox domain 294480 Homeobox domain 294481 Homeobox domain 294482 Homeobox domain 294483 Homeobox domain 294484 Homeobox domain 294485 Homeobox domain 294486 Homeobox domain 294487 Homeobox domain 294488 Homeobox domain 294489 Homeobox domain 294490 Homeobox domain 294491 Homeobox domain 294491 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 294492 Homeobox domain 294493 Homeobox domain 294494 Homeobox domain 294495 Homeobox domain 294496 Homeobox domain 294497 Homeobox domain 294498 Homeobox domain 294503 Transforming growth factor beta like domain 294504 Inorganic pyrophosphatase 294506 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 294509 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294509 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294510 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 294515 Fork head domain 294524 Translation initiation factor SUI1 294528 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 294529 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 294530 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 294532 'Cold-shock' DNA-binding domain 294533 'Cold-shock' DNA-binding domain 294534 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 294535 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 294540 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 294543 Ribosomal protein S6e 294545 Protein kinase domain 294547 Ribosomal protein L21e 294549 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 294554 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 294555 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 294559 Zona pellucida-like domain 294562 Homeobox domain 294562 Pou domain - N-terminal to homeobox domain 294564 TEA/ATTS domain family 294566 Skp1 family, tetramerisation domain 294567 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 294568 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 294569 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 294570 Homeobox domain 294571 Homeobox domain 294572 Homeobox domain 294573 Homeobox domain 294574 Homeobox domain 294575 Homeobox domain 294576 Homeobox domain 294577 Homeobox domain 294578 Homeobox domain 294579 Homeobox domain 294580 Homeobox domain 294581 Homeobox domain 294582 Homeobox domain 294583 Homeobox domain 294584 Homeobox domain 294585 Homeobox domain 294586 Homeobox domain 294587 Homeobox domain 294588 Homeobox domain 294589 Homeobox domain 294590 Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps 294592 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 294593 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 294595 TPR Domain 294598 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294600 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 294601 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 294602 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 294603 Stromal antigen (SA/STAG) protein 294606 Homeobox domain 294606 Pou domain - N-terminal to homeobox domain 294607 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 294608 TT viral orf 1. TT virus (TTV), isolated initially from a Japanese patient with hepatitis of unknown aetiology, has since been found to infect both healthy and diseased individuals and numerous prevalence studies have raised questions about its role in un 294609 Ribosomal protein L6e 294617 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 294617 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 294618 Uracil DNA glycosylase superfamily 294620 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 294621 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 294623 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294625 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294625 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294629 Eukaryotic porin 294635 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 294637 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 294638 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 294639 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 294640 Homeobox domain 294642 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 294644 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 294647 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 294648 Ribosomal protein L31e 294649 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 294650 DNA mismatch repair proteins, mutS family 294650 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 294655 Ribosomal protein S6e 294657 Sushi domain (SCR repeat) 294657 Lectin C-type domain. This family includes both long and short form C-type 294657 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 294658 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294658 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294660 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 294661 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 294662 ATP synthase subunit C 294663 7 transmembrane receptor (rhodopsin family) 294664 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 294668 Ribosomal protein S26e 294668 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 294669 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 294672 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 294672 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 294673 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 294674 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 294676 Ribosomal protein S15 294679 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 294680 NAC domain 294689 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 294690 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 294691 Ribosomal protein L21e 294691 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 294692 ENV polyprotein (coat polyprotein) 294693 Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex 294698 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 294700 Ribosomal protein L21e 294702 Ribosomal protein L35Ae 294704 Nucleoside diphosphate kinase 294705 ICE-like protease (caspase) p20 domain 294706 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 294707 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 294708 TPR Domain 294711 Cyclophilin type peptidyl-prolyl cis-trans isomerase 294713 Ribosomal protein L21e 294716 pfam02872, 5_nucleotidaseC, 5'-nucleotidase, C-terminal domain 294717 Zinc finger, C3HC4 type (RING finger) 294718 Ribosomal RNA adenine dimethylase 294721 Ribosomal L29e protein family 294722 Ribosomal L29e protein family 294725 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294727 Mitochondrial carrier protein 294728 Mitochondrial carrier protein 294730 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 294731 Phosphoglycerate kinase 294732 Actin 294737 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 294739 Fibronectin type III domain 294741 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 294744 Glutathione peroxidase 294746 Trypsin 294748 Translation initiation factor SUI1 294753 MoaE protein. This family contains the MoaE protein that is involved in biosynthesis of molybdopterin. Molybdopterin, the universal component of the pterin molybdenum cofactors, contains a dithiolene group serving to bind Mo. Addition of the dithiolene su 294754 eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 294754 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 294764 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294764 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294771 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 294773 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 294776 Ets-domain 294777 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 294780 Fibronectin type III domain 294781 Ribosomal protein L21e 294788 Ribosomal protein S5, C-terminal domain 294788 Ribosomal protein S5, N-terminal domain 294790 F-box domain 294792 UDP-glucoronosyl and UDP-glucosyl transferase 294793 UDP-glucoronosyl and UDP-glucosyl transferase 294794 UDP-glucoronosyl and UDP-glucosyl transferase 294795 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 294798 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 294799 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 294800 Repair protein Rad1/Rec1/Rad17 294802 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294805 Profilin 294806 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 294807 7 transmembrane receptor (rhodopsin family) 294809 Ribosomal protein L23 294809 Thrombospondin type 1 domain 294809 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 294810 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 294811 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294814 RNase3 domain 294814 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 294815 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 294816 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 294817 Ribosomal protein S26e 294818 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294821 Peptidase family M49 294823 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 294826 ENV polyprotein (coat polyprotein) 294829 HMG (high mobility group) box 294831 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 294832 Ribosomal L29 protein 294835 Hsp90 protein 294840 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294840 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294842 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 294844 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294848 Hsp90 protein 294849 Hsp90 protein 294852 Ribosomal protein L19e 294853 Intermediate filament protein 294854 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 294857 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 294858 Aconitase family (aconitate hydratase) 294861 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 294864 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 294869 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 294871 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 294872 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 294873 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 294874 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 294875 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 294876 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 294879 Eukaryotic-type carbonic anhydrase 294884 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 294885 Fructose-bisphosphate aldolase class-I 294885 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 294887 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 294888 Mitosis protein DIM1 294888 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 294889 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 294891 Mitosis protein DIM1 294892 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 294893 ENV polyprotein (coat polyprotein) 294894 Eukaryotic initiation factor 1A 294895 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294896 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b 294897 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 294900 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a 294901 Uncharacterized ACR, COG1490 294902 Eukaryotic porin 294903 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294903 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294908 Phosphoglycerate kinase 294909 FMN-dependent dehydrogenase 294909 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor 294909 Conserved region in glutamate synthase. This family represents a region of the glutamate synthase protein. This region is expressed as a separate subunit in the glutamate synthase alpha subunit from archaebacteria, or part of a large multidomain enzyme in 294909 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 294912 Translocation protein Sec62 294913 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 294916 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 294917 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 294922 Ribosomal protein S6e 294923 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 294924 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 294926 Fibronectin type III domain 294933 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 294933 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 294934 Carboxylesterase 294935 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 294941 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 294942 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 294946 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 294947 3'5'-cyclic nucleotide phosphodiesterase 294950 Phosphoglycerate kinase 294953 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 294954 Helix-loop-helix DNA-binding domain 294955 Ribosomal protein L21e 294958 Tropomyosin 294959 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 294961 Calcium-activated potassium channel, beta subunit 294961 Calcium-activated potassium channel, beta subunit 294962 WD domain, G-beta repeat 294962 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 294966 Microtubule associated protein (MAP65/ASE1 family) 294968 Macrophage migration inhibitory factor (MIF) 294970 ENV polyprotein (coat polyprotein) 294972 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 294974 7 transmembrane receptor (rhodopsin family) 294976 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 294979 Adenosine-deaminase (editase) domain 294983 Galactoside-binding lectin 294985 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 294985 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 294987 Ribosomal protein L23 294989 Ribosomal protein L31e 294990 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 294991 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 294991 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 294995 Coenzyme A transferase 295000 Ribosomal L15 295004 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 295007 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 295008 Fructose-bisphosphate aldolase class-I 295010 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295013 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295014 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295015 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295016 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 295018 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 295019 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 295022 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295023 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295024 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295025 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295026 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295027 Cadherin domain 295028 Ribosomal L18ae protein family 295030 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 295031 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295034 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 295037 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 295038 Eukaryotic initiation factor 4E 295041 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 295043 Uncharacterized ACR, YggU family COG1872 295047 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 295050 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 295050 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 295051 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 295052 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 295054 Helix-loop-helix DNA-binding domain 295057 Beige/BEACH domain 295065 Ribosomal protein S19e 295066 Zinc finger, C3HC4 type (RING finger) 295066 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 295066 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 295072 Carboxylesterase 295073 Carboxylesterase 295073 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 295074 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 295074 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 295075 Carboxylesterase 295076 Carboxylesterase 295077 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 295080 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295081 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 295082 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 295084 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 295085 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 295085 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 295086 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 295088 GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E 295089 7 transmembrane receptor (metabotropic glutamate family) 295090 7 transmembrane receptor (metabotropic glutamate family) 295091 7 transmembrane receptor (metabotropic glutamate family) 295092 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295093 7 transmembrane receptor (metabotropic glutamate family) 295094 7 transmembrane receptor (metabotropic glutamate family) 295096 7 transmembrane receptor (metabotropic glutamate family) 295098 Ribosomal protein L44 295099 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295099 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295101 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 295103 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 295107 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 295107 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 295112 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 295112 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 295113 Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40 295114 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 295117 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 295117 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 295118 Ribosomal protein L19e 295122 GTPase of unknown function 295123 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 295124 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 295129 Eukaryotic porin 295130 WD domain, G-beta repeat 295131 Intermediate filament protein 295133 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 295133 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 295134 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple 295135 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295135 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295138 Ribosomal protein S5, C-terminal domain 295139 Hsp90 protein 295140 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 295140 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 295144 7 transmembrane receptor (rhodopsin family) 295154 Hsp90 protein 295156 Enolase, N-terminal domain 295156 Enolase, C-terminal TIM barrel domain 295156 Ribosomal protein S5, N-terminal domain 295159 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 295160 Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair 295162 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 295162 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 295165 WD domain, G-beta repeat 295168 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295169 Ribosomal protein L21e 295171 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 295172 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 295173 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 295175 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 295179 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 295180 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 295184 Serine hydroxymethyltransferase 295186 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 295189 Polyprenyl synthetase 295190 Spumavirus gag protein 295192 WD domain, G-beta repeat 295193 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 295195 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 295199 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295199 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295200 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295201 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295202 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 295202 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295204 S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 295204 S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 295207 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 295208 Ribosomal protein S5, C-terminal domain 295209 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 295211 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 295213 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 295214 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 295214 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 295219 AMP-binding enzyme 295220 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 295221 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 295222 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 295227 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 295228 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 295229 YjeF-related protein N-terminus 295230 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 295234 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyne 295239 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 295239 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 295240 Protein kinase domain 295242 Protein of unknown function DUF124. This prokaryotic protein family has no known function 295245 MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these pr 295249 Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N typ 295250 Phosphotyrosine interaction domain (PTB/PID) 295251 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 295252 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 295254 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295254 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 295254 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295255 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295255 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295256 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295256 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295257 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 295258 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295258 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295259 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 295261 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 295262 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 295263 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 295268 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 295268 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 295269 Ribosomal L15 295271 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 295272 Core histone H2A/H2B/H3/H4 295273 Core histone H2A/H2B/H3/H4 295274 Core histone H2A/H2B/H3/H4 295274 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 295277 Core histone H2A/H2B/H3/H4 295278 Core histone H2A/H2B/H3/H4 295278 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 295280 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295280 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295284 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295285 Extracellular link domain 295287 Intermediate filament protein 295288 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 295292 Protein kinase domain 295294 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295295 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 295296 Ets-domain 295297 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 295298 HMG (high mobility group) box 295300 Enolase, C-terminal TIM barrel domain 295304 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 295304 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 295305 Histidine acid phosphatase 295307 ENV polyprotein (coat polyprotein) 295309 Disintegrin 295309 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 295309 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 295314 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 295316 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 295317 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 295319 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 295321 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 295322 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 295323 Filamin/ABP280 repeat 295324 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 295326 Mab-21 protein 295327 Helix-loop-helix DNA-binding domain 295329 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 295329 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 295330 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295330 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295332 Ribosomal L15 295333 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 295333 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 295333 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 295335 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 295339 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 295340 Ribosomal protein L10 295341 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 295342 ADP-ribosylation factor family 295342 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 295343 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 295344 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 295345 Ribosomal protein S6e 295346 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 295347 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 295349 Glycosyl hydrolases family 18 295350 Glycosyl hydrolases family 18 295351 Glycosyl hydrolases family 18 295352 Glycosyl hydrolases family 18 295353 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295353 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295355 7 transmembrane receptor (rhodopsin family) 295358 Ribosomal L29e protein family 295359 Ribosomal protein L34e 295360 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 295360 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 295361 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 295362 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 295362 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 295363 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 295367 Cadherin domain 295367 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 295367 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 295367 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 295370 WD domain, G-beta repeat 295373 Ribosomal protein L21e 295375 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 295376 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 295378 PH domain. PH stands for pleckstrin homology 295378 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 295379 Ribosomal protein S17 295380 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 295382 Laminin N-terminal (Domain VI) 295384 Conserved hypothetical protein 95 295385 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295386 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295386 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295388 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 295388 Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee 295389 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 295390 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295393 Ribosomal protein L19e 295395 RNA 3'-terminal phosphate cyclase 295400 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 295401 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 295403 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 295406 AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4- 295407 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295411 Ribonucleotide reductase, small chain 295412 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295420 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295420 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295423 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295423 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295429 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 295430 Sulfotransferase protein 295435 Ribosomal protein S6e 295436 Ribosomal protein L19e 295437 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295439 Ribosomal protein L21e 295441 ENV polyprotein (coat polyprotein) 295442 ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PA 295443 ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PA 295446 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 295448 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 295449 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 295450 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 295452 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295452 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295453 Cyclophilin type peptidyl-prolyl cis-trans isomerase 295458 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 295460 Ribosomal protein S26e 295464 Ribosomal protein L31e 295468 Aminotransferase class-III 295470 Ribosomal L29e protein family 295471 Ribosomal protein S6e 295472 Ribosomal protein L21e 295473 Ribosomal L29e protein family 295475 Helix-loop-helix DNA-binding domain 295476 S-adenosyl-L-homocysteine hydrolase 295480 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 295481 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295487 Ribosomal L29e protein family 295491 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 295495 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 295496 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 295497 metallopeptidase family M24 295498 Zinc-binding dehydrogenase 295499 Zinc-binding dehydrogenase 295500 metallopeptidase family M24 295501 Cyclophilin type peptidyl-prolyl cis-trans isomerase 295501 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295502 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 295503 Nucleoside diphosphate kinase 295505 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 295506 Eukaryotic-type carbonic anhydrase 295508 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295508 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295509 Zinc finger, C3HC4 type (RING finger) 295511 Death domain 295512 Ribosomal L39 protein 295518 SH2 domain 295523 Hr1 repeat 295523 Protein kinase domain 295523 Protein kinase C terminal domain 295524 Ribosomal protein S19e 295529 14-3-3 protein 295530 Protein kinase domain 295531 Protein kinase domain 295531 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 295532 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 295537 Eukaryotic porin 295538 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 295541 DNA mismatch repair proteins, mutS family 295541 MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p 295543 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f 295544 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295545 Ribosomal protein L6 295549 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 295551 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295553 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 295554 Ribosomal protein S5, C-terminal domain 295556 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 295561 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 295562 Ribosomal protein L34 295563 GTPase of unknown function 295566 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 295567 TNF(Tumor Necrosis Factor) family 295568 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 295570 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295572 ENV polyprotein (coat polyprotein) 295574 Ribosomal protein S19 295575 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 295576 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295577 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295577 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295578 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295580 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295580 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295581 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295581 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295582 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295585 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 295585 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 295586 Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains 295587 Ribosomal protein L31e 295588 ADP-ribosylation factor family 295588 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 295590 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 295597 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295597 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295600 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 295603 Dihydropyridine sensitive L-type calcium channel (Beta subunit) 295603 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 295604 Ribosomal L29e protein family 295609 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 295610 Cyclophilin type peptidyl-prolyl cis-trans isomerase 295610 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 295611 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295612 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 295614 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 295616 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295616 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295617 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295617 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295620 Phosphorylase family. Members of this family include: purine nucleoside phosphorylase (PNP) Uridine phosphorylase (UdRPase) 5'-methylthioadenosine phosphorylase (MTA phosphorylase) 295628 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has 295630 Lectin C-type domain. This family includes both long and short form C-type 295632 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 295633 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295635 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 295640 WD domain, G-beta repeat 295641 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 295641 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 295642 ENV polyprotein (coat polyprotein) 295643 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295643 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295644 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 295645 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 295648 ENV polyprotein (coat polyprotein) 295649 Shikimate kinase 295649 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 295650 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 295658 Ribosomal protein S7e 295660 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 295662 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 295664 Conserved hypothetical protein 95 295664 C-5 cytosine-specific DNA methylase 295664 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 295666 Protein kinase domain 295667 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 295669 Cytochrome b561 295670 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 295672 FKBP-type peptidyl-prolyl cis-trans isomerase 295673 FKBP-type peptidyl-prolyl cis-trans isomerase 295676 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 295681 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295681 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295682 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 295684 HMG (high mobility group) box 295686 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 295688 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 295690 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 295691 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 295691 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 295694 Eukaryotic ribosomal protein L18 295698 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 295698 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 295699 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 295700 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 295701 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 295702 Yippee putative zinc-binding protein 295703 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 295704 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 295705 Mitochondrial carrier protein 295706 Lectin C-type domain. This family includes both long and short form C-type 295708 7 transmembrane receptor (rhodopsin family) 295709 7 transmembrane receptor (rhodopsin family) 295710 7 transmembrane receptor (rhodopsin family) 295711 7 transmembrane receptor (rhodopsin family) 295712 7 transmembrane receptor (rhodopsin family) 295713 7 transmembrane receptor (rhodopsin family) 295714 Fatty acid desaturase 295715 7 transmembrane receptor (rhodopsin family) 295716 7 transmembrane receptor (rhodopsin family) 295717 7 transmembrane receptor (rhodopsin family) 295718 7 transmembrane receptor (rhodopsin family) 295719 7 transmembrane receptor (rhodopsin family) 295720 7 transmembrane receptor (rhodopsin family) 295721 7 transmembrane receptor (rhodopsin family) 295722 7 transmembrane receptor (rhodopsin family) 295723 7 transmembrane receptor (rhodopsin family) 295724 7 transmembrane receptor (rhodopsin family) 295725 7 transmembrane receptor (rhodopsin family) 295726 7 transmembrane receptor (rhodopsin family) 295727 7 transmembrane receptor (rhodopsin family) 295728 7 transmembrane receptor (rhodopsin family) 295729 7 transmembrane receptor (rhodopsin family) 295730 7 transmembrane receptor (rhodopsin family) 295731 Ribosomal protein S19 295732 7 transmembrane receptor (rhodopsin family) 295733 7 transmembrane receptor (rhodopsin family) 295734 7 transmembrane receptor (rhodopsin family) 295735 7 transmembrane receptor (rhodopsin family) 295736 7 transmembrane receptor (rhodopsin family) 295737 7 transmembrane receptor (rhodopsin family) 295738 7 transmembrane receptor (rhodopsin family) 295739 7 transmembrane receptor (rhodopsin family) 295740 7 transmembrane receptor (rhodopsin family) 295741 7 transmembrane receptor (rhodopsin family) 295742 7 transmembrane receptor (rhodopsin family) 295743 7 transmembrane receptor (rhodopsin family) 295744 7 transmembrane receptor (rhodopsin family) 295745 7 transmembrane receptor (rhodopsin family) 295746 7 transmembrane receptor (rhodopsin family) 295747 7 transmembrane receptor (rhodopsin family) 295748 7 transmembrane receptor (rhodopsin family) 295749 7 transmembrane receptor (rhodopsin family) 295750 7 transmembrane receptor (rhodopsin family) 295751 7 transmembrane receptor (rhodopsin family) 295752 7 transmembrane receptor (rhodopsin family) 295753 7 transmembrane receptor (rhodopsin family) 295754 7 transmembrane receptor (rhodopsin family) 295755 7 transmembrane receptor (rhodopsin family) 295756 7 transmembrane receptor (rhodopsin family) 295757 7 transmembrane receptor (rhodopsin family) 295758 7 transmembrane receptor (rhodopsin family) 295759 7 transmembrane receptor (rhodopsin family) 295760 7 transmembrane receptor (rhodopsin family) 295761 7 transmembrane receptor (rhodopsin family) 295762 7 transmembrane receptor (rhodopsin family) 295763 7 transmembrane receptor (rhodopsin family) 295764 7 transmembrane receptor (rhodopsin family) 295765 7 transmembrane receptor (rhodopsin family) 295765 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 295766 7 transmembrane receptor (rhodopsin family) 295767 7 transmembrane receptor (rhodopsin family) 295768 7 transmembrane receptor (rhodopsin family) 295769 7 transmembrane receptor (rhodopsin family) 295770 7 transmembrane receptor (rhodopsin family) 295771 7 transmembrane receptor (rhodopsin family) 295772 7 transmembrane receptor (rhodopsin family) 295773 7 transmembrane receptor (rhodopsin family) 295774 7 transmembrane receptor (rhodopsin family) 295775 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 295776 7 transmembrane receptor (rhodopsin family) 295777 7 transmembrane receptor (rhodopsin family) 295778 7 transmembrane receptor (rhodopsin family) 295778 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 295779 7 transmembrane receptor (rhodopsin family) 295780 7 transmembrane receptor (rhodopsin family) 295781 7 transmembrane receptor (rhodopsin family) 295782 7 transmembrane receptor (rhodopsin family) 295783 7 transmembrane receptor (rhodopsin family) 295784 7 transmembrane receptor (rhodopsin family) 295785 7 transmembrane receptor (rhodopsin family) 295786 7 transmembrane receptor (rhodopsin family) 295787 7 transmembrane receptor (rhodopsin family) 295788 7 transmembrane receptor (rhodopsin family) 295789 7 transmembrane receptor (rhodopsin family) 295790 7 transmembrane receptor (rhodopsin family) 295791 7 transmembrane receptor (rhodopsin family) 295792 7 transmembrane receptor (rhodopsin family) 295793 7 transmembrane receptor (rhodopsin family) 295795 7 transmembrane receptor (rhodopsin family) 295797 7 transmembrane receptor (rhodopsin family) 295798 7 transmembrane receptor (rhodopsin family) 295799 7 transmembrane receptor (rhodopsin family) 295800 7 transmembrane receptor (rhodopsin family) 295802 7 transmembrane receptor (rhodopsin family) 295803 7 transmembrane receptor (rhodopsin family) 295806 7 transmembrane receptor (rhodopsin family) 295807 7 transmembrane receptor (rhodopsin family) 295808 7 transmembrane receptor (rhodopsin family) 295809 7 transmembrane receptor (rhodopsin family) 295810 Actin 295811 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 295812 7 transmembrane receptor (rhodopsin family) 295813 7 transmembrane receptor (rhodopsin family) 295814 7 transmembrane receptor (rhodopsin family) 295815 7 transmembrane receptor (rhodopsin family) 295816 7 transmembrane receptor (rhodopsin family) 295817 7 transmembrane receptor (rhodopsin family) 295818 7 transmembrane receptor (rhodopsin family) 295819 7 transmembrane receptor (rhodopsin family) 295820 7 transmembrane receptor (rhodopsin family) 295821 7 transmembrane receptor (rhodopsin family) 295822 7 transmembrane receptor (rhodopsin family) 295823 7 transmembrane receptor (rhodopsin family) 295824 7 transmembrane receptor (rhodopsin family) 295825 7 transmembrane receptor (rhodopsin family) 295826 7 transmembrane receptor (rhodopsin family) 295827 7 transmembrane receptor (rhodopsin family) 295828 7 transmembrane receptor (rhodopsin family) 295829 7 transmembrane receptor (rhodopsin family) 295830 7 transmembrane receptor (rhodopsin family) 295831 7 transmembrane receptor (rhodopsin family) 295832 7 transmembrane receptor (rhodopsin family) 295833 7 transmembrane receptor (rhodopsin family) 295834 7 transmembrane receptor (rhodopsin family) 295835 7 transmembrane receptor (rhodopsin family) 295836 7 transmembrane receptor (rhodopsin family) 295837 7 transmembrane receptor (rhodopsin family) 295838 7 transmembrane receptor (rhodopsin family) 295839 7 transmembrane receptor (rhodopsin family) 295840 7 transmembrane receptor (rhodopsin family) 295841 7 transmembrane receptor (rhodopsin family) 295842 7 transmembrane receptor (rhodopsin family) 295843 7 transmembrane receptor (rhodopsin family) 295844 7 transmembrane receptor (rhodopsin family) 295845 7 transmembrane receptor (rhodopsin family) 295848 7 transmembrane receptor (rhodopsin family) 295849 7 transmembrane receptor (rhodopsin family) 295850 7 transmembrane receptor (rhodopsin family) 295851 7 transmembrane receptor (rhodopsin family) 295852 7 transmembrane receptor (rhodopsin family) 295852 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 295853 7 transmembrane receptor (rhodopsin family) 295854 7 transmembrane receptor (rhodopsin family) 295855 7 transmembrane receptor (rhodopsin family) 295856 7 transmembrane receptor (rhodopsin family) 295857 7 transmembrane receptor (rhodopsin family) 295858 7 transmembrane receptor (rhodopsin family) 295859 7 transmembrane receptor (rhodopsin family) 295860 7 transmembrane receptor (rhodopsin family) 295861 7 transmembrane receptor (rhodopsin family) 295862 7 transmembrane receptor (rhodopsin family) 295863 7 transmembrane receptor (rhodopsin family) 295864 7 transmembrane receptor (rhodopsin family) 295865 7 transmembrane receptor (rhodopsin family) 295866 7 transmembrane receptor (rhodopsin family) 295867 7 transmembrane receptor (rhodopsin family) 295868 7 transmembrane receptor (rhodopsin family) 295869 7 transmembrane receptor (rhodopsin family) 295870 7 transmembrane receptor (rhodopsin family) 295871 7 transmembrane receptor (rhodopsin family) 295872 7 transmembrane receptor (rhodopsin family) 295873 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 295874 7 transmembrane receptor (rhodopsin family) 295875 7 transmembrane receptor (rhodopsin family) 295876 7 transmembrane receptor (rhodopsin family) 295877 7 transmembrane receptor (rhodopsin family) 295878 7 transmembrane receptor (rhodopsin family) 295878 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 295879 7 transmembrane receptor (rhodopsin family) 295880 7 transmembrane receptor (rhodopsin family) 295881 7 transmembrane receptor (rhodopsin family) 295882 7 transmembrane receptor (rhodopsin family) 295883 7 transmembrane receptor (rhodopsin family) 295884 7 transmembrane receptor (rhodopsin family) 295885 7 transmembrane receptor (rhodopsin family) 295886 7 transmembrane receptor (rhodopsin family) 295887 7 transmembrane receptor (rhodopsin family) 295889 7 transmembrane receptor (rhodopsin family) 295890 7 transmembrane receptor (rhodopsin family) 295891 7 transmembrane receptor (rhodopsin family) 295892 7 transmembrane receptor (rhodopsin family) 295893 7 transmembrane receptor (rhodopsin family) 295894 7 transmembrane receptor (rhodopsin family) 295895 7 transmembrane receptor (rhodopsin family) 295896 7 transmembrane receptor (rhodopsin family) 295897 7 transmembrane receptor (rhodopsin family) 295898 7 transmembrane receptor (rhodopsin family) 295900 7 transmembrane receptor (rhodopsin family) 295901 7 transmembrane receptor (rhodopsin family) 295902 7 transmembrane receptor (rhodopsin family) 295903 7 transmembrane receptor (rhodopsin family) 295904 7 transmembrane receptor (rhodopsin family) 295905 7 transmembrane receptor (rhodopsin family) 295906 7 transmembrane receptor (rhodopsin family) 295907 7 transmembrane receptor (rhodopsin family) 295908 7 transmembrane receptor (rhodopsin family) 295909 7 transmembrane receptor (rhodopsin family) 295910 7 transmembrane receptor (rhodopsin family) 295911 7 transmembrane receptor (rhodopsin family) 295912 7 transmembrane receptor (rhodopsin family) 295913 7 transmembrane receptor (rhodopsin family) 295914 7 transmembrane receptor (rhodopsin family) 295915 7 transmembrane receptor (rhodopsin family) 295916 7 transmembrane receptor (rhodopsin family) 295917 7 transmembrane receptor (rhodopsin family) 295918 7 transmembrane receptor (rhodopsin family) 295919 7 transmembrane receptor (rhodopsin family) 295920 7 transmembrane receptor (rhodopsin family) 295921 7 transmembrane receptor (rhodopsin family) 295922 Mitochondrial carrier protein 295923 Respiratory-chain NADH dehydrogenase, 30 Kd subunit 295924 Elongation factor 1 gamma, conserved domain 295924 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 295925 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 295928 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 295929 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 295934 Sulfotransferase protein 295935 Cyclophilin type peptidyl-prolyl cis-trans isomerase 295938 OAR domain 295939 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 295939 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 295941 Ribosomal protein L21e 295942 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 295943 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 295944 Hsp90 protein 295946 Hsp90 protein 295949 Ribosomal L15 295950 Enolase, C-terminal TIM barrel domain 295951 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 295953 Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activated 295954 Zinc finger, C3HC4 type (RING finger) 295958 Ribosomal protein L21e 295961 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 295964 Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function 295965 Ets-domain 295966 Ets-domain 295966 Sterile alpha motif (SAM)/Pointed domain 295969 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 295972 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 295973 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 295974 Ribosomal protein L35Ae 295975 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 295977 Ribosomal protein L6e 295978 4-hydroxybenzoyl-CoA thioesterase. This enzyme (EC 3.1.2.23) catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3 295979 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 295980 Doublecortin 295981 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 295984 Ribosomal protein S6e 295985 MraW methylase family. Members of this family are probably SAM dependent methyltransferases. This family appears to be related to pfam01596 295986 SH2 domain 295987 MraW methylase family. Members of this family are probably SAM dependent methyltransferases. This family appears to be related to pfam01596 295988 Kinesin motor domain 295989 S-adenosyl-L-homocysteine hydrolase 295990 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 295990 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 295991 7 transmembrane receptor (rhodopsin family) 295993 Mitochondrial carrier protein 295996 Ribosomal protein L19e 295999 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296000 7 transmembrane receptor (rhodopsin family) 296001 7 transmembrane receptor (rhodopsin family) 296002 7 transmembrane receptor (rhodopsin family) 296003 7 transmembrane receptor (rhodopsin family) 296004 7 transmembrane receptor (rhodopsin family) 296005 7 transmembrane receptor (rhodopsin family) 296007 Ribosomal protein S8e 296008 7 transmembrane receptor (rhodopsin family) 296009 7 transmembrane receptor (rhodopsin family) 296010 7 transmembrane receptor (rhodopsin family) 296011 7 transmembrane receptor (rhodopsin family) 296012 7 transmembrane receptor (rhodopsin family) 296013 7 transmembrane receptor (rhodopsin family) 296014 7 transmembrane receptor (rhodopsin family) 296015 7 transmembrane receptor (rhodopsin family) 296016 7 transmembrane receptor (rhodopsin family) 296017 7 transmembrane receptor (rhodopsin family) 296018 7 transmembrane receptor (rhodopsin family) 296019 7 transmembrane receptor (rhodopsin family) 296020 7 transmembrane receptor (rhodopsin family) 296021 7 transmembrane receptor (rhodopsin family) 296022 7 transmembrane receptor (rhodopsin family) 296023 7 transmembrane receptor (rhodopsin family) 296024 7 transmembrane receptor (rhodopsin family) 296025 7 transmembrane receptor (rhodopsin family) 296026 7 transmembrane receptor (rhodopsin family) 296027 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296028 7 transmembrane receptor (rhodopsin family) 296029 7 transmembrane receptor (rhodopsin family) 296031 7 transmembrane receptor (rhodopsin family) 296032 7 transmembrane receptor (rhodopsin family) 296033 7 transmembrane receptor (rhodopsin family) 296034 7 transmembrane receptor (rhodopsin family) 296035 7 transmembrane receptor (rhodopsin family) 296037 7 transmembrane receptor (rhodopsin family) 296038 7 transmembrane receptor (rhodopsin family) 296039 7 transmembrane receptor (rhodopsin family) 296040 7 transmembrane receptor (rhodopsin family) 296041 7 transmembrane receptor (rhodopsin family) 296042 7 transmembrane receptor (rhodopsin family) 296043 7 transmembrane receptor (rhodopsin family) 296044 7 transmembrane receptor (rhodopsin family) 296045 7 transmembrane receptor (rhodopsin family) 296046 7 transmembrane receptor (rhodopsin family) 296047 7 transmembrane receptor (rhodopsin family) 296054 Ribosomal protein L35Ae 296056 Zinc-binding dehydrogenase 296057 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 296059 Formin Homology 2 Domain 296060 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 296061 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 296063 Ribosomal protein L13e 296064 Actin 296070 Ribosomal L39 protein 296071 Enolase, N-terminal domain 296071 Enolase, C-terminal TIM barrel domain 296072 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 296075 Protein kinase domain 296076 Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding 296077 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 296078 Protein kinase domain 296078 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 296079 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 296082 Helix-hairpin-helix motif 296082 recA bacterial DNA recombination protein 296083 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 296084 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 296088 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 296088 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 296091 C2 domain 296091 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 296094 Putative zinc finger in N-recognin 296096 Transglutaminase family, C-terminal ig like domain 296099 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 296102 Occludin/ELL family 296104 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 296106 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 296111 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 296111 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 296112 S-adenosyl-L-homocysteine hydrolase 296116 GHMP kinases putative ATP-binding protein 296117 GHMP kinases putative ATP-binding protein 296119 DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after 296121 Ubiquitin carboxyl-terminal hydrolase family 2 296122 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 296124 Ribosomal protein S11 296126 pfam02889, Sec63, Sec63 domain 296129 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family 296133 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296134 Ribosomal protein S5, C-terminal domain 296138 Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold 296139 Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA 296139 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 296140 Ribosomal protein S17 296143 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296143 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296150 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 296152 Transglutaminase family 296152 Transglutaminase family, C-terminal ig like domain 296152 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 296156 Zinc carboxypeptidase 296157 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296165 Ribosomal protein L11, RNA binding domain 296165 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 296167 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform 296169 Proteasome A-type and B-type 296171 Ribosomal S3Ae family 296172 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 296173 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 296175 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 296176 HMG (high mobility group) box 296182 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 296186 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296186 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296190 ubiE/COQ5 methyltransferase family 296190 Putative methyltransferase. Members of this family of hypothetical plant proteins are probably methyltransferases: several of the aligned sequences either match methyltransferase profiles, or contain a SAM-binding motif. One member contains both 296190 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple 296190 SAM dependent carboxyl methyltransferase. This family of plant methyltransferases contains enzymes that act on a variety of substrates including salicylic acid, jasmonic acid and 7-Methylxanthine. Caffeine is synthesized through sequential three-step meth 296196 Ribosomal protein S6e 296197 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 296198 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 296203 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 296204 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296204 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296206 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 296209 Ribosomal L10 296210 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 296215 Eukaryotic ribosomal protein L18 296217 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 296218 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 296219 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 296220 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296221 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296222 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296224 Translation initiation factor SUI1 296225 Translation initiation factor SUI1 296227 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 296228 Peptidase family M49 296231 Translation initiation factor SUI1 296232 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296234 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296235 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296237 Ribosomal protein L13e 296240 Apoptosis regulator proteins, Bcl-2 family 296244 GHMP kinases putative ATP-binding protein 296245 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 296246 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296247 TEA/ATTS domain family 296248 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296249 Core histone H2A/H2B/H3/H4 296250 Protein kinase domain 296252 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 296255 GHMP kinases putative ATP-binding protein 296256 Protein kinase domain 296257 Ribosomal protein S2 296257 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 296258 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296260 Ribosomal protein S19e 296261 AMP-binding enzyme 296262 Ribosomal protein S2 296265 Protein kinase domain 296266 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296266 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 296266 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has 296269 Fibrinogen beta and gamma chains, C-terminal globular domain 296272 Helix-loop-helix DNA-binding domain 296276 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 296279 Endomembrane protein 70 296280 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 296282 Ribosomal protein L15 296283 Ribosomal protein L21e 296286 Enolase, C-terminal TIM barrel domain 296287 EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the 296290 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 296292 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 296296 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 296297 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 296298 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 296299 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 296299 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 296299 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 296301 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 296302 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 296304 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 296305 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 296305 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 296307 C2 domain 296310 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 296313 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 296314 Homeobox domain 296316 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 296317 Guanylate-kinase-associated protein (GKAP) protein 296318 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to 296321 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 296321 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 296323 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 296325 Ribosomal protein L13e 296326 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296331 Protein-tyrosine phosphatase 296332 Protein-tyrosine phosphatase 296333 Protein-tyrosine phosphatase 296336 Fibronectin type III domain 296339 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 296341 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 296341 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 296345 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 296349 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 296349 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 296350 TMS membrane protein/tumour differentially expressed protein (TDE) 296351 Poly-adenylate binding protein, unique domain 296351 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 296356 WAP-type (Whey Acidic Protein) 'four-disulfide core' 296357 WAP-type (Whey Acidic Protein) 'four-disulfide core' 296358 von Willebrand factor type A domain 296360 Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to new 296362 Adenylate kinase 296365 Ribosomal protein L13e 296368 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 296369 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 296370 Serine carboxypeptidase 296371 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 296377 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 296381 CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis 296385 pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex 296386 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 296389 Eukaryotic ribosomal protein L18 296390 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 296394 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 296395 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 296395 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 296401 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 296402 PTB domain (IRS-1 type) 296406 Ribosomal protein L15 296407 Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4 296409 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 296412 Transcription factor AP-2 296413 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 296416 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296417 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296420 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296420 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 296421 Ubiquitin carboxyl-terminal hydrolase, family 1 296423 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296424 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296425 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296426 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296427 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296428 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296429 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296431 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296432 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296433 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296434 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296435 Ribosomal protein L24e 296436 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296437 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296438 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296440 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296441 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296444 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296445 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 296446 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296447 Stathmin family 296448 Ribosomal protein S19 296451 Ribosomal protein L19e 296453 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296453 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296455 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 296461 Oxysterol-binding protein 296462 GTP1/OBG family 296462 GTPase of unknown function 296462 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 296467 YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f 296471 Protein kinase domain 296472 SH2 domain 296472 Protein kinase domain 296472 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 296473 Ribosomal protein L35Ae 296474 RNB-like protein. The function of this region of similarity is uncertain 296478 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 296480 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a 296485 RNA polymerases M/15 Kd subunit 296485 Transcription factor S-II (TFIIS) 296489 Ribosomal protein L6 296492 Enolase, N-terminal domain 296492 Enolase, C-terminal TIM barrel domain 296492 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 296497 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 296498 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 296499 Homeobox domain 296502 Ribosomal protein L21e 296503 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296503 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 296504 Glycosyltransferase family 6 296505 Cadherin domain 296506 Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va 296508 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 296510 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 296511 OAR domain 296511 Homeobox domain 296513 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 296515 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296516 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 296517 Helix-loop-helix DNA-binding domain 296518 ENV polyprotein (coat polyprotein) 296519 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 296521 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296522 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296524 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 296525 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 296526 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296526 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 296532 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 296532 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 296533 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 296534 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 296536 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296536 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296537 Transcription factor TFIID (or TATA-binding protein, TBP) 296539 Transcription factor S-II (TFIIS) 296540 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 296541 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 296542 Ribosomal protein L23 296545 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296546 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296547 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296548 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296553 Ribosomal protein L14p/L23e 296554 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 296554 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 296558 Cullin family 296559 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 296560 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid 296563 Ribosomal L28e protein family 296564 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10 296566 Taurine catabolism dioxygenase TauD, TfdA family. This family consists of taurine catabolism dioxygenases of the TauD, TfdA family. TauD from E. coli is a alpha-ketoglutarate-dependent taurine dioxygenase. This enzyme catalyses the oxygenolytic release of 296568 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296570 Helix-turn-helix. This large family of DNA binding helix-turn helix proteins includes Cro and CI 296573 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296574 Eukaryotic initiation factor 4E 296574 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296575 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296576 Eukaryotic initiation factor 4E 296576 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296577 Glycosyltransferase family 6 296578 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296581 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 296582 Ribosomal protein S5, C-terminal domain 296582 Ribosomal protein S5, N-terminal domain 296586 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 296588 Insulinase (Peptidase family M16) 296592 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296593 Aminotransferase class I and II 296594 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296595 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296598 SURF4 family 296603 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 296605 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 296614 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 296619 Protein kinase domain 296620 DNA/RNA non-specific endonuclease 296624 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 296625 Choline/Carnitine o-acyltransferase 296625 Major royal jelly protein. Royal jelly is the food of queen bee larvae, and is responsible for the high reproductive ability of the queen. Major royal jelly proteins make up around 90% of larval jelly proteins. This family also the sequence-related yellow 296627 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296630 Cyclophilin type peptidyl-prolyl cis-trans isomerase 296635 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 296641 Peptidyl-tRNA hydrolase 296644 Ribosomal protein L11, RNA binding domain 296644 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 296645 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 296646 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 296646 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 296647 Ribosomal L39 protein 296654 Gelsolin repeat 296655 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 296656 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 296657 Glycosyltransferase family 6 296659 NAC domain 296660 7 transmembrane receptor (rhodopsin family) 296661 7 transmembrane receptor (rhodopsin family) 296662 7 transmembrane receptor (rhodopsin family) 296663 7 transmembrane receptor (rhodopsin family) 296664 7 transmembrane receptor (rhodopsin family) 296665 7 transmembrane receptor (rhodopsin family) 296666 7 transmembrane receptor (rhodopsin family) 296667 7 transmembrane receptor (rhodopsin family) 296669 7 transmembrane receptor (rhodopsin family) 296670 7 transmembrane receptor (rhodopsin family) 296672 7 transmembrane receptor (rhodopsin family) 296673 7 transmembrane receptor (rhodopsin family) 296674 7 transmembrane receptor (rhodopsin family) 296675 7 transmembrane receptor (rhodopsin family) 296676 7 transmembrane receptor (rhodopsin family) 296677 7 transmembrane receptor (rhodopsin family) 296678 7 transmembrane receptor (rhodopsin family) 296679 7 transmembrane receptor (rhodopsin family) 296680 7 transmembrane receptor (rhodopsin family) 296681 7 transmembrane receptor (rhodopsin family) 296682 7 transmembrane receptor (rhodopsin family) 296683 7 transmembrane receptor (rhodopsin family) 296684 7 transmembrane receptor (rhodopsin family) 296685 7 transmembrane receptor (rhodopsin family) 296686 7 transmembrane receptor (rhodopsin family) 296687 7 transmembrane receptor (rhodopsin family) 296688 7 transmembrane receptor (rhodopsin family) 296688 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 296689 7 transmembrane receptor (rhodopsin family) 296690 7 transmembrane receptor (rhodopsin family) 296691 7 transmembrane receptor (rhodopsin family) 296692 7 transmembrane receptor (rhodopsin family) 296693 7 transmembrane receptor (rhodopsin family) 296694 7 transmembrane receptor (rhodopsin family) 296695 Nucleoside diphosphate kinase 296697 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296698 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296699 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 296702 Nucleoside diphosphate kinase 296703 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 296704 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 296706 Homeobox domain 296706 LIM domain. This family represents two copies of the LIM structural domain 296707 7 transmembrane receptor (Secretin family) 296708 Olfactomedin-like domain 296711 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 296712 ENV polyprotein (coat polyprotein) 296713 Ribosomal L10 296714 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 296717 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 296717 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296717 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 296718 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 296721 HMG (high mobility group) box 296723 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 296724 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 296730 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 296734 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 296734 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296735 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 296735 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 296739 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 296744 Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targetin 296748 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 296752 Ribosomal protein L21e 296754 Macrophage migration inhibitory factor (MIF) 296764 Guanylate kinase 296767 Intermediate filament protein 296767 Intermediate filament tail domain 296768 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 296770 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 296772 ENV polyprotein (coat polyprotein) 296773 Ribosomal protein L19e 296774 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 296775 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 296786 CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion 296787 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 296788 Cadherin domain 296789 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 296790 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 296791 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296791 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296804 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 296806 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296807 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 296813 Mitochondrial carrier protein 296817 RanBP1 domain 296821 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 296822 Proteasome A-type and B-type 296823 HMG (high mobility group) box 296826 Protein kinase domain 296827 Protein kinase domain 296838 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 296846 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 296849 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 296851 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H 296852 Gelsolin repeat 296852 Villin headpiece domain 296853 Ribosomal L29e protein family 296854 Ribosomal L29e protein family 296855 Mitochondrial carrier protein 296856 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 296857 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296857 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296858 Ribosomal L29e protein family 296859 Lyase 296860 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 296862 Guanylate-kinase-associated protein (GKAP) protein 296866 7 transmembrane receptor (rhodopsin family) 296867 7 transmembrane receptor (rhodopsin family) 296868 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296868 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296869 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296869 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296870 Ribosomal protein L34e 296874 7 transmembrane receptor (Secretin family) 296874 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296875 Enolase, N-terminal domain 296875 Enolase, C-terminal TIM barrel domain 296876 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 296877 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 296878 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 296879 Ribosomal protein S2 296880 Ribosomal protein L31e 296882 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296882 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 296886 Actin 296889 HMG14 and HMG17 296891 GTPase of unknown function 296891 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 296891 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 296893 Stathmin family 296894 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 296896 Ribosomal protein S5, N-terminal domain 296897 Thrombospondin type 1 domain 296898 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 296903 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 296904 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 296905 Ribosomal protein L21e 296907 LIM domain. This family represents two copies of the LIM structural domain 296910 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 296911 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain 296912 wnt family 296913 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 296915 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 296918 Nucleoside diphosphate kinase 296919 Tetraspanin family 296923 Translationally controlled tumor protein 296925 Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopine 296926 Ribosomal protein S5, C-terminal domain 296930 Zinc finger, C3HC4 type (RING finger) 296930 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 296932 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 296936 WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin 296937 Hyaluronidase 296938 Hyaluronidase 296939 Hyaluronidase 296940 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 296944 Thymidine kinase 296948 Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species 296949 Ribosomal protein L35Ae 296951 Filamin/ABP280 repeat 296955 Phosphoglycerate kinase 296956 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 296959 Zinc carboxypeptidase 296960 Zinc carboxypeptidase 296960 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 296964 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 296967 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 296968 Guanylate kinase 296968 Minor capsid protein VI. This minor capsid protein may act as a link between the external capsid and the internal DNA-protein core. The C-terminal 11 residues may function as a protease cofactor leading to enzyme activation 296970 Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu 296972 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 296973 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 296980 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 296981 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 296985 Lectin C-type domain. This family includes both long and short form C-type 296987 HMG (high mobility group) box 296991 Protein kinase domain 296993 Protein kinase domain 296996 Protein kinase domain 296998 7 transmembrane receptor (rhodopsin family) 296999 7 transmembrane receptor (rhodopsin family) 297000 Lectin C-type domain. This family includes both long and short form C-type 297002 Trypsin 297003 Trypsin 297004 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297005 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297009 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297010 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297012 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297015 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297016 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297017 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297019 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297020 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297022 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297025 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be 297027 7 transmembrane receptor (rhodopsin family) 297029 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HC 297029 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HCC 297031 LIM domain. This family represents two copies of the LIM structural domain 297034 7 transmembrane receptor (rhodopsin family) 297035 7 transmembrane receptor (rhodopsin family) 297036 7 transmembrane receptor (rhodopsin family) 297037 7 transmembrane receptor (rhodopsin family) 297039 7 transmembrane receptor (rhodopsin family) 297040 7 transmembrane receptor (rhodopsin family) 297041 7 transmembrane receptor (rhodopsin family) 297042 7 transmembrane receptor (rhodopsin family) 297043 7 transmembrane receptor (rhodopsin family) 297044 7 transmembrane receptor (rhodopsin family) 297045 7 transmembrane receptor (rhodopsin family) 297046 7 transmembrane receptor (rhodopsin family) 297047 7 transmembrane receptor (rhodopsin family) 297048 7 transmembrane receptor (rhodopsin family) 297049 7 transmembrane receptor (rhodopsin family) 297050 7 transmembrane receptor (rhodopsin family) 297051 7 transmembrane receptor (rhodopsin family) 297053 Homeobox domain 297054 Calreticulin family 297056 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297058 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297063 Ribosomal protein L44 297064 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 297066 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 297066 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 297068 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297069 von Willebrand factor type D domain 297072 Thrombospondin type 1 domain 297072 von Willebrand factor type D domain 297072 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297076 GTPase of unknown function 297076 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 297079 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 297081 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 297082 Domain of unknown function DUF143. This domain has no known function nor do any of the proteins that possess it. The aligned region is approximately 100 amino acids long 297083 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 297084 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 297086 Ribosomal protein S17 297087 Protein kinase domain 297088 Animal haem peroxidase 297088 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 297089 Ribosomal protein L31e 297090 Protein kinase domain 297093 'chromo' (CHRromatin Organization MOdifier) domain 297093 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 297094 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 297095 MAS20 protein import receptor 297096 PX domain. PX domains bind to phosphoinositides 297097 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297098 Ribosomal protein L31e 297099 Homeobox domain 297100 Homeobox domain 297102 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 297103 ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold 297103 ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain 297112 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 297112 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 297113 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 297114 NNMT/PNMT/TEMT family 297116 GrpE 297117 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 297118 Ribosomal protein S2 297119 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297120 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297121 7 transmembrane receptor (rhodopsin family) 297121 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297122 7 transmembrane receptor (rhodopsin family) 297122 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297123 FKBP-type peptidyl-prolyl cis-trans isomerase 297124 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297125 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297125 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297126 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297127 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297129 7 transmembrane receptor (rhodopsin family) 297129 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297130 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297131 7 transmembrane receptor (rhodopsin family) 297131 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297132 Hsp90 protein 297133 Hsp90 protein 297133 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297134 7 transmembrane receptor (rhodopsin family) 297134 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297135 Ribosomal protein S6e 297136 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297137 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297138 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 297139 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297140 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297141 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297142 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297143 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297144 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297145 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297146 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297147 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297150 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 297151 Ribosomal protein S7e 297153 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 297154 Cyclophilin type peptidyl-prolyl cis-trans isomerase 297155 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 297161 Enolase, N-terminal domain 297161 Enolase, C-terminal TIM barrel domain 297164 Ribosomal L29e protein family 297165 HMG (high mobility group) box 297167 Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation 297168 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 297170 7 transmembrane receptor (rhodopsin family) 297173 ADP-ribosylation factor family 297173 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 297174 'Cold-shock' DNA-binding domain 297175 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 297176 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 297177 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297178 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297180 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297182 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297183 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297185 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297187 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297189 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297189 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 297190 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297191 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297195 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 297197 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297202 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297203 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297204 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297207 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297208 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297209 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297211 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297213 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297216 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297217 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297218 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297219 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297220 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297221 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297224 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297225 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297226 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297228 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297229 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297230 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297231 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297232 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297234 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297235 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297236 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297239 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297240 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297243 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297247 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297248 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297249 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297254 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297256 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297258 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297259 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297260 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297261 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297262 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297263 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297264 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297268 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297270 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297272 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297274 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297279 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297280 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297281 Fibrillarin 297282 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297285 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297287 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297288 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297289 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 297290 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297291 Fibrillarin 297292 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297294 HMG (high mobility group) box 297294 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297300 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297304 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297305 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297306 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297307 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297308 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297309 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297311 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297312 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297314 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297316 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297317 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297318 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297321 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297322 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297323 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297324 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297325 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297326 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297328 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297330 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297331 Fork head domain 297336 Ubiquitin carboxyl-terminal hydrolase family 2 297337 Zinc finger, C3HC4 type (RING finger) 297338 SH2 domain 297339 Gelsolin repeat 297345 ENV polyprotein (coat polyprotein) 297346 7 transmembrane receptor (rhodopsin family) 297348 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 297352 HMG (high mobility group) box 297353 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 297354 Ribosomal protein L15 297356 Ribosomal protein S8 297357 Vinculin family 297358 Vinculin family 297359 Vinculin family 297360 Vinculin family 297362 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297363 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297363 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297365 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297366 Cyclophilin type peptidyl-prolyl cis-trans isomerase 297367 Sec1 family 297369 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 297369 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 297370 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 297371 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 297372 Ribosomal protein L19 297373 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 297374 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 297374 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 297376 Trypsin 297376 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 297377 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 297379 Homeobox domain 297386 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 297387 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 297388 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 297389 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 297390 Ribosomal L29e protein family 297391 Lectin C-type domain. This family includes both long and short form C-type 297392 Protein of unknown function (DUF396). A family of conserved eukaryotic transmembrane proteins 297394 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 297395 Ribosomal protein S19 297399 Ribosomal protein L31e 297401 Protein kinase domain 297402 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 297404 Arginosuccinate synthase. This family contains a PP-loop motif 297405 Homeobox domain 297406 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 297413 Helix-loop-helix DNA-binding domain 297416 NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown 297417 SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Presumably the SIS 297418 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 297419 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 297420 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 297421 Transforming growth factor beta like domain 297421 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 297423 PH domain. PH stands for pleckstrin homology 297424 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 297425 ADP-ribosylation factor family 297425 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 297428 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 297430 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 297432 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 297432 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 297437 Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases 297438 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297439 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297440 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297441 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297442 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297443 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 297448 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 297449 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 297452 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 297452 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 297455 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 297460 Ribosomal protein L10 297463 Guanylate kinase 297464 Dynein light chain type 1 297468 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 297469 Proteasome A-type and B-type 297470 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 297471 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 297472 ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity 297472 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 297473 NOL1/NOP2/sun family 297475 Ribosomal protein L35Ae 297478 Ribosomal protein L23 297479 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297479 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297480 Fork head domain 297481 Eukaryotic initiation factor 4E 297484 Actin 297487 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 297488 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 297489 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 297489 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 297491 Ribosomal protein L13e 297493 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 297494 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 297494 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 297495 Ribosomal protein L13e 297498 ATP-dependent protease La (LON) domain 297499 Ribosomal protein S7e 297500 Actin 297503 ADP-ribosylation factor family 297505 ENV polyprotein (coat polyprotein) 297506 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 297507 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 297513 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 297521 Death domain 297522 WD domain, G-beta repeat 297528 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 297532 Ribosomal protein L6 297534 Ribosomal protein L23 297536 7 transmembrane receptor (rhodopsin family) 297537 7 transmembrane receptor (rhodopsin family) 297538 7 transmembrane receptor (rhodopsin family) 297539 Hsp90 protein 297540 7 transmembrane receptor (rhodopsin family) 297542 7 transmembrane receptor (rhodopsin family) 297543 7 transmembrane receptor (rhodopsin family) 297544 7 transmembrane receptor (rhodopsin family) 297545 7 transmembrane receptor (rhodopsin family) 297546 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 297547 Ribosomal protein S6e 297548 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 297550 Enolase, N-terminal domain 297550 Enolase, C-terminal TIM barrel domain 297550 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 297553 ENV polyprotein (coat polyprotein) 297555 Cadherin domain 297555 Protein kinase domain 297556 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 297557 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 297558 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 297560 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 297563 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 297565 Ribosomal protein L21e 297566 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 297567 Eukaryotic porin 297568 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 297569 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 297571 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 297572 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 297574 Ribosomal protein S26e 297577 Transforming growth factor beta like domain 297577 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 297580 Homeobox domain 297581 Ribosomal L29e protein family 297583 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 297584 Lectin C-type domain. This family includes both long and short form C-type 297585 Lectin C-type domain. This family includes both long and short form C-type 297586 Lectin C-type domain. This family includes both long and short form C-type 297587 Lectin C-type domain. This family includes both long and short form C-type 297588 7 transmembrane receptor (metabotropic glutamate family) 297588 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 297589 7 transmembrane receptor (metabotropic glutamate family) 297590 7 transmembrane receptor (metabotropic glutamate family) 297597 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 297598 7 transmembrane receptor (rhodopsin family) 297602 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 297603 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 297603 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 297605 Ribosomal protein L23 297606 Cyclophilin type peptidyl-prolyl cis-trans isomerase 297608 Ribosomal protein S5, C-terminal domain 297610 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 297611 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 297611 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 297614 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297615 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297616 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297617 TEA/ATTS domain family 297617 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297618 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297619 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297620 TEA/ATTS domain family 297621 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297622 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297623 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297624 TEA/ATTS domain family 297624 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 297626 TEA/ATTS domain family 297629 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 297630 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297632 Lectin C-type domain. This family includes both long and short form C-type 297633 Lectin C-type domain. This family includes both long and short form C-type 297634 Lectin C-type domain. This family includes both long and short form C-type 297635 Lectin C-type domain. This family includes both long and short form C-type 297637 Lectin C-type domain. This family includes both long and short form C-type 297638 Ribosomal L29e protein family 297640 Lectin C-type domain. This family includes both long and short form C-type 297641 Lectin C-type domain. This family includes both long and short form C-type 297644 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 297644 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 297646 Lectin C-type domain. This family includes both long and short form C-type 297647 Lectin C-type domain. This family includes both long and short form C-type 297648 Lectin C-type domain. This family includes both long and short form C-type 297649 Lectin C-type domain. This family includes both long and short form C-type 297650 Lectin C-type domain. This family includes both long and short form C-type 297651 Lectin C-type domain. This family includes both long and short form C-type 297652 Lectin C-type domain. This family includes both long and short form C-type 297654 Lectin C-type domain. This family includes both long and short form C-type 297656 Lectin C-type domain. This family includes both long and short form C-type 297657 Lectin C-type domain. This family includes both long and short form C-type 297661 Lectin C-type domain. This family includes both long and short form C-type 297664 Lectin C-type domain. This family includes both long and short form C-type 297665 Lectin C-type domain. This family includes both long and short form C-type 297666 Lectin C-type domain. This family includes both long and short form C-type 297667 Lectin C-type domain. This family includes both long and short form C-type 297668 Lectin C-type domain. This family includes both long and short form C-type 297669 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the C 297670 Protein kinase domain 297676 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 297677 S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 297677 S-adenosylmethionine synthetase, N-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 297679 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 297682 Protein-tyrosine phosphatase 297682 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 297683 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 297685 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 297690 Core histone H2A/H2B/H3/H4 297697 Core histone H2A/H2B/H3/H4 297699 WD domain, G-beta repeat 297700 Deoxyribose-phosphate aldolase. This family includes the enzyme deoxyribose-phosphate aldolase EC:4.1.2.4, which is involved in nucleotide metabolism. The family also includes a group of related bacterial proteins of unknown function 297701 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 297702 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 297703 C2 domain 297703 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 297703 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 297704 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 297706 Ribosomal protein L44 297706 Elongation factor 1 gamma, conserved domain 297707 ENV polyprotein (coat polyprotein) 297707 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 297708 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 297712 Ribosomal protein L44 297713 Phosphorylase family 2 297714 Ribosomal L28e protein family 297715 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 297719 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 297723 Ribosomal protein L31e 297731 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 297739 Ribosomal L10 297743 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 297752 Ribosomal family S4e 297755 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 297759 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 297762 ENV polyprotein (coat polyprotein) 297765 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 297767 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 297768 Metallo-beta-lactamase superfamily 297769 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 297769 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 297770 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297770 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297771 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 297771 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 297772 'Cold-shock' DNA-binding domain 297777 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297777 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297785 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297786 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297790 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 297791 Kinesin motor domain 297793 Aminotransferase class I and II 297795 7 transmembrane receptor (rhodopsin family) 297796 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 297797 V-ATPase subunit H 297798 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 297799 Ribosomal protein L15 amino terminal region. This family is always associated with pfam00256. This family is diagnostic of ribosomal L15 proteins 297802 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 297803 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 297805 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 297806 Protein-tyrosine phosphatase 297806 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 297807 Cyclophilin type peptidyl-prolyl cis-trans isomerase 297809 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297809 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297810 Clathrin adaptor complex small chain 297813 Translation initiation factor SUI1 297814 Eukaryotic-type carbonic anhydrase 297816 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 297819 ribosomal L5P family C-terminus. This region is found associated with pfam00281 297820 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 297821 Squalene/phytoene synthase 297826 Enolase, N-terminal domain 297826 Enolase, C-terminal TIM barrel domain 297827 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 297832 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 297835 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297836 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297837 AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4- 297838 MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also i 297840 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 297844 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 297845 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 297848 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297849 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 297851 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 297852 Hsp90 protein 297852 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 297853 HMG (high mobility group) box 297855 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 297855 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 297860 Eukaryotic porin 297863 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 297864 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 297864 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 297869 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 297880 Sushi domain (SCR repeat) 297881 Ribosomal protein S12 297885 Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues 297886 Ribosomal L29e protein family 297888 Ribosomal protein L35Ae 297889 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 297891 Protein kinase domain 297892 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 297893 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 297894 von Willebrand factor type A domain 297897 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 297898 Ribosomal L10 297899 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 297899 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 297902 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 297902 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 297902 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 297907 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 297908 Ribosomal L29e protein family 297909 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 297909 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 297910 Sulfate transporter family. Mutations may lead to several human diseases 297911 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 297915 ENV polyprotein (coat polyprotein) 297916 Protein kinase domain 297918 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 297918 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 297919 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 297920 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 297920 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 297922 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 297927 HMG (high mobility group) box 297930 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 297931 Ribosomal protein L13e 297932 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 297934 ICE-like protease (caspase) p10 domain 297934 ICE-like protease (caspase) p20 domain 297937 Hsp90 protein 297938 Hsp90 protein 297939 Ribosomal L29e protein family 297945 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297945 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297946 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297946 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297947 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 297948 Ribosomal protein L6 297949 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 297951 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 297952 7 transmembrane receptor (rhodopsin family) 297954 LIM domain. This family represents two copies of the LIM structural domain 297956 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297958 Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans 297966 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 297969 tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase 297970 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 297974 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 297974 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 297976 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 297980 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 297981 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 297981 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 297982 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 297982 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 297986 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 297986 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 297987 Ribosomal protein S6e 297988 Trypsin 297992 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 297994 BAG domain. Domain present in Hsp70 regulators 297994 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 297996 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 297998 NUDIX domain 298003 Galactose-1-phosphate uridyl transferase, C-terminal domain. SCOP reports fold duplication with N-terminal domain. Both involved in Zn and Fe binding 298005 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 298006 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 298008 Eukaryotic ribosomal protein L18 298009 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298009 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298010 Choline/Carnitine o-acyltransferase 298011 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298011 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298012 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 298013 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 298014 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 298015 Ribosomal protein S2 298016 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298016 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298018 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 298019 Vinculin family 298021 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 298025 Sushi domain (SCR repeat) 298026 von Willebrand factor type A domain 298031 Sushi domain (SCR repeat) 298033 PX domain. PX domains bind to phosphoinositides 298036 Ribosomal protein L23 298038 7 transmembrane receptor (rhodopsin family) 298039 7 transmembrane receptor (rhodopsin family) 298040 7 transmembrane receptor (rhodopsin family) 298042 7 transmembrane receptor (rhodopsin family) 298043 7 transmembrane receptor (rhodopsin family) 298044 7 transmembrane receptor (rhodopsin family) 298045 7 transmembrane receptor (rhodopsin family) 298046 7 transmembrane receptor (rhodopsin family) 298047 7 transmembrane receptor (rhodopsin family) 298048 7 transmembrane receptor (rhodopsin family) 298049 7 transmembrane receptor (rhodopsin family) 298054 Iron/manganese superoxide dismutases, alpha-hairpin domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one 298054 Iron/manganese superoxide dismutases, C-terminal domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. I 298057 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 298060 Fructose-bisphosphate aldolase class-I 298062 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 298063 7 transmembrane receptor (rhodopsin family) 298067 Proteasome A-type and B-type 298068 Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG 298070 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 298071 Antifreeze-like domain. This family contains type III antifreeze proteins as well as a variety of enzymes. This domain is presumed to be involved in sugar binding in the enzyme proteins 298071 NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only 298073 Zinc-binding dehydrogenase 298073 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 298074 XPA protein 298079 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 298080 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 298081 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 298082 Ribosomal protein L24e 298084 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 298085 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 298087 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298088 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298090 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298091 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hig 298095 WD domain, G-beta repeat 298097 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 298098 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 298099 Herpesvirus glycoprotein D. Herpesviruses are dsDNA viruses with no RNA stage. This is a family consists of glycoprotein-D (gD or gIV) which is common to herpes simplex virus types 1 and 2, as well as equine herpes, bovine herpes and Marek's disease virus 298101 Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc 298103 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 298105 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298107 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298108 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298108 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 298109 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298110 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298111 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298111 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298112 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298113 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298114 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298115 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298115 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 298116 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 298117 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 298117 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 298118 Enolase, N-terminal domain 298118 Enolase, C-terminal TIM barrel domain 298123 Core histone H2A/H2B/H3/H4 298125 Ribosomal protein L13e 298126 Ribosomal protein L31e 298128 ENV polyprotein (coat polyprotein) 298135 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298136 Ribosomal protein L34e 298137 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 298138 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 298145 Eukaryotic initiation factor 4E 298148 Elongation factor 1 gamma, conserved domain 298148 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 298148 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 298152 Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos 298153 Ribosomal protein S2 298154 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 298155 GTP1/OBG family 298156 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 298158 WD domain, G-beta repeat 298159 WD domain, G-beta repeat 298161 HMG (high mobility group) box 298162 ENV polyprotein (coat polyprotein) 298163 ENV polyprotein (coat polyprotein) 298169 Actin 298171 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 298172 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other 298173 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298176 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 298177 Lyase 298180 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 298180 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 298180 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 298184 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 298187 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 298193 Cyclophilin type peptidyl-prolyl cis-trans isomerase 298196 Thrombospondin type 1 domain 298197 Thrombospondin type 1 domain 298197 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 298199 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 298200 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298200 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298203 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 298204 Ribosomal protein L21e 298205 Interferon alpha/beta domain 298206 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298207 Interferon alpha/beta domain 298208 Interferon alpha/beta domain 298209 Interferon alpha/beta domain 298210 Interferon alpha/beta domain 298211 Interferon alpha/beta domain 298212 Interferon alpha/beta domain 298213 Interferon alpha/beta domain 298214 Interferon alpha/beta domain 298215 Interferon alpha/beta domain 298216 Interferon alpha/beta domain 298217 Interferon alpha/beta domain 298218 Ribosomal protein S24e 298219 Interferon alpha/beta domain 298220 Interferon alpha/beta domain 298221 Interferon alpha/beta domain 298222 Interferon alpha/beta domain 298223 Interferon alpha/beta domain 298224 Interferon alpha/beta domain 298225 Interferon alpha/beta domain 298226 Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold 298227 Phosphorylase family 2 298229 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 298230 Intermediate filament tail domain 298234 Eukaryotic porin 298237 Ribosomal protein S7e 298238 HMG (high mobility group) box 298239 Ribosomal protein S27 298241 Chaperonin 10 Kd subunit 298243 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 298245 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 298246 Ribosomal protein L19e 298247 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 298248 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 298248 Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases 298249 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 298250 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 298252 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298253 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 298254 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298258 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298258 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298260 Ribosomal protein L21e 298262 Ribosomal protein L21e 298267 Cache domain 298268 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 298270 Ribosomal protein L6 298278 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 298279 Hsp90 protein 298284 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 298285 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298285 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298288 Low-density lipoprotein receptor domain class A 298288 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 298289 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 298293 Actin 298296 Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont 298297 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298297 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298302 START domain 298305 Oxidoreductase FAD-binding domain 298305 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 298305 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 298306 CUB domain 298313 Mitochondrial carrier protein 298314 Mitochondrial carrier protein 298316 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 298316 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 298317 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 298318 Mitochondrial carrier protein 298319 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 298322 Zinc finger, C3HC4 type (RING finger) 298322 ATP-dependent protease La (LON) domain 298326 LIM domain. This family represents two copies of the LIM structural domain 298327 LIM domain. This family represents two copies of the LIM structural domain 298328 LIM domain. This family represents two copies of the LIM structural domain 298331 LIM domain. This family represents two copies of the LIM structural domain 298332 LIM domain. This family represents two copies of the LIM structural domain 298334 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 298336 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 298336 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 298338 Zn-finger in Ran binding protein and others 298339 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 298340 Sodium:neurotransmitter symporter family 298344 Homeobox domain 298345 Homeobox domain 298346 Homeobox domain 298347 Homeobox domain 298348 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 298349 BTG1 family. A novel family of anti-proliferative proteins 298350 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 298351 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 298352 Homeobox domain 298353 Homeobox domain 298358 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298358 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298359 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298360 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298361 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298362 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298367 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298367 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298369 Oxysterol-binding protein 298370 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 298371 NAC domain 298374 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about 298376 Glutathione peroxidase 298381 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 298385 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the C 298386 Ribosomal protein S8 298392 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 298392 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 298393 7 transmembrane receptor (rhodopsin family) 298394 7 transmembrane receptor (rhodopsin family) 298395 7 transmembrane receptor (rhodopsin family) 298396 7 transmembrane receptor (rhodopsin family) 298397 7 transmembrane receptor (rhodopsin family) 298403 Tropomyosin 298408 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 298408 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 298410 Adenylate kinase 298410 Thymidylate kinase 298412 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298414 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298415 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 298417 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 298418 Core histone H2A/H2B/H3/H4 298419 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 298420 Core histone H2A/H2B/H3/H4 298421 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298422 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 298423 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298423 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 298424 Polyprenyl synthetase 298428 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 298429 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 298431 Tetraspanin family 298432 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis of 298435 GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis of 298436 Tetraspanin family 298437 Ribosomal protein L21e 298439 SpoVR like protein. Bacillus subtilis stage V sporulation protein R is involved in spore cortex formation. Little is known about cortex biosynthesis, except that it depends on several sigma E controlled genes, including spoVR 298439 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 298444 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 298445 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 298448 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 298451 ATP synthase subunit C 298452 Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein sy 298454 7 transmembrane receptor (metabotropic glutamate family) 298455 7 transmembrane receptor (metabotropic glutamate family) 298455 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 298457 7 transmembrane receptor (metabotropic glutamate family) 298457 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 298459 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 298461 Protein kinase domain 298463 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 298465 Ribosomal protein L21e 298468 7 transmembrane receptor (rhodopsin family) 298469 7 transmembrane receptor (rhodopsin family) 298470 7 transmembrane receptor (rhodopsin family) 298471 7 transmembrane receptor (rhodopsin family) 298472 7 transmembrane receptor (rhodopsin family) 298473 7 transmembrane receptor (rhodopsin family) 298474 7 transmembrane receptor (rhodopsin family) 298475 7 transmembrane receptor (rhodopsin family) 298476 7 transmembrane receptor (rhodopsin family) 298477 7 transmembrane receptor (rhodopsin family) 298478 7 transmembrane receptor (rhodopsin family) 298479 7 transmembrane receptor (rhodopsin family) 298480 7 transmembrane receptor (rhodopsin family) 298481 7 transmembrane receptor (rhodopsin family) 298482 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 298483 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 298485 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 298487 PMP-22/EMP/MP20/Claudin family 298488 Cyclophilin type peptidyl-prolyl cis-trans isomerase 298490 DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism 298491 CAP protein 298492 Ribosomal protein S5, N-terminal domain 298492 Ribosomal protein S5, C-terminal domain 298494 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 298495 Ribosomal protein L35Ae 298496 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 298496 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 298501 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 298503 Palmitoyl protein thioesterase 298506 Helix-loop-helix DNA-binding domain 298506 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invol 298508 Cyclophilin type peptidyl-prolyl cis-trans isomerase 298509 Ribosomal protein L21e 298510 Poly-adenylate binding protein, unique domain 298511 Growth-Arrest-Specific Protein 2 Domain 298512 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 298517 Ribosomal protein S15 298519 14-3-3 protein 298519 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 298520 Hsp90 protein 298521 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 298521 Bacterial extracellular solute-binding proteins, family 3 298521 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 298521 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 298528 Protein kinase domain 298528 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 298532 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 298533 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 298535 Ribosomal L29e protein family 298541 Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase 298542 'Cold-shock' DNA-binding domain 298550 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 298558 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298558 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 298559 Transcription factor S-II (TFIIS) 298561 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 298562 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298564 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 298566 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 298566 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 298567 Trypsin 298569 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 298570 Rap/ran-GAP 298570 LGN motif, putative GEF specific for G-alpha GTPase 298573 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 298574 Kinesin motor domain 298575 Protein kinase domain 298577 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 298578 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 298579 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 298582 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 298584 F-actin capping protein, beta subunit 298590 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 298595 Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-a 298606 Mitochondrial carrier protein 298607 Arginase family 298611 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 298614 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 298623 Carboxylesterase 298625 MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these pr 298627 Ribosomal protein L36e 298628 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298629 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298635 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298639 Protein kinase domain 298640 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298642 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 298644 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 298648 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298649 Ribosomal protein L36e 298650 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold 298653 Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase 298654 Mitochondrial carrier protein 298655 Glucose-6-phosphate dehydrogenase, C-terminal domain 298655 Glucose-6-phosphate dehydrogenase, NAD binding domain 298655 Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase 298657 Eukaryotic-type carbonic anhydrase 298661 Ribosomal protein S7e 298671 Actin 298674 Zinc finger, C3HC4 type (RING finger) 298675 Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex 298679 Protein kinase domain 298682 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 298683 Fibronectin type III domain 298683 von Willebrand factor type A domain 298684 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 298685 Ribosomal protein L20 298686 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 298689 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 298690 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 298692 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 298693 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 298696 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 298696 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 298698 Fibrinogen beta and gamma chains, C-terminal globular domain 298701 SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors 298702 Marek's disease glycoprotein A 298702 Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, d 298703 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 298704 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 298706 Protein kinase domain 298707 Ribosomal protein S19 298709 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 298710 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 298713 Core histone H2A/H2B/H3/H4 298715 TRAF-type zinc finger 298717 Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function 298719 WD domain, G-beta repeat 298724 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 298725 7 transmembrane receptor (rhodopsin family) 298726 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 298727 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298727 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298730 BTG1 family. A novel family of anti-proliferative proteins 298731 Glutamate/Leucine/Phenylalanine/Valine dehydrogenase 298733 7 transmembrane receptor (rhodopsin family) 298734 7 transmembrane receptor (rhodopsin family) 298735 Protein kinase domain 298735 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 298736 Protein kinase domain 298736 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 298739 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 298740 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 298742 Ribosomal protein S19 298744 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 298746 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298748 Uncharacterized ACR, COG1565 298749 Ribosomal S17 298750 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298750 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298754 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298754 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298758 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 298760 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 298762 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 298764 Synaptophysin / synaptoporin 298766 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 298774 Cyclophilin type peptidyl-prolyl cis-trans isomerase 298777 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 298778 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 298778 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 298779 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 298783 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 298785 Ribosomal protein S26e 298786 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 298787 Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown 298788 HMG (high mobility group) box 298791 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 298792 Yippee putative zinc-binding protein 298793 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298793 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298795 14-3-3 protein 298798 Enolase, C-terminal TIM barrel domain 298801 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 298803 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess 298804 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess 298805 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 298807 Glutaredoxin 298810 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298811 Ribosomal protein L19e 298814 Glutaredoxin 298815 Ribosomal protein L19e 298816 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298817 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298818 Ribosomal protein L19e 298819 Ribosomal protein L19e 298820 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298821 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298822 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298823 Ribosomal protein L19e 298825 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298826 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298827 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298828 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298829 Ribosomal protein L19e 298830 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298832 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298834 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 298835 Ribosomal protein L19e 298839 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 298840 Aspartate/ornithine carbamoyltransferase, Asp/Orn binding domain 298842 WD domain, G-beta repeat 298848 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 298848 EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the 298849 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 298855 Ribosomal protein L36e 298857 7 transmembrane receptor (Secretin family) 298857 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 298858 NAC domain 298859 ADP-ribosylation factor family 298859 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 298859 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 298862 Ribosomal L29e protein family 298863 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 298865 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 298866 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 298867 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 298867 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 298871 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 298873 Amino acid permease 298874 Pumilio-family RNA binding repeat. Puf repeats (aka PUM-HD, Pumilio homology domain) are necessary and sufficient for sequence specific RNA binding in fly Pumilio and worm FBF-1 and FBF-2. Both proteins function as translational repressors in early embryo 298875 Golgi 4-transmembrane spanning transporter 298880 Ribosomal protein S5, C-terminal domain 298882 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 298883 XPG I-region 298883 XPG N-terminal domain 298886 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 298887 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 298893 Ribosomal protein S8 298894 Helix-loop-helix DNA-binding domain 298894 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invol 298895 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 298896 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 298900 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 298901 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 298902 Uracil DNA glycosylase superfamily 298904 ribosomal L5P family C-terminus. This region is found associated with pfam00281 298908 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 298908 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 298912 Fibrinogen beta and gamma chains, C-terminal globular domain 298913 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 298917 Lipase/Acylhydrolase with GDSL-like motif 298919 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 298920 Ribonucleotide reductase, small chain 298921 Ribosomal L10 298921 Ribosomal protein L31e 298922 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298923 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain 298924 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 298931 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 298932 Allantoicase repeat. This family is found in pairs in Allantoicases, forming the majority of the protein. These proteins allow the use of purines as secondary nitrogen sources in nitrogen-limiting conditions through the reaction: allantoate + H(2)0 = (-)- 298933 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 298934 ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be e 298942 Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases 298942 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 298944 7 transmembrane receptor (rhodopsin family) 298946 Phosphatidylinositol 3- and 4-kinase 298947 Phosphatidylinositol 3- and 4-kinase 298947 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 298947 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 298947 PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains 298948 Eukaryotic-type carbonic anhydrase 298949 Synaptophysin / synaptoporin 298950 Helix-loop-helix DNA-binding domain 298951 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 298954 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 298956 PXA domain. This domain is associated with PX domains pfam00787 298957 Hsp90 protein 298958 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298958 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298960 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298962 Tetraspanin family 298965 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 298967 Chaperonin 10 Kd subunit 298968 Ribosomal protein S5, C-terminal domain 298968 Ribosomal protein S5, N-terminal domain 298972 HMG (high mobility group) box 298973 Ribosomal L29e protein family 298974 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 298976 Gelsolin repeat 298981 Homeobox domain 298984 Ribosomal protein L21e 298985 Ribosomal protein S26e 298987 Ribosomal protein L6 298988 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 298989 Enolase, N-terminal domain 298989 Enolase, C-terminal TIM barrel domain 298990 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298990 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298991 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298991 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 298992 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 298992 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 298993 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 298994 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 298994 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 298996 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 298997 ATP synthase subunit C 298999 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 298999 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299001 Ribosomal protein L31e 299002 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 299004 Clathrin adaptor complex small chain 299004 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 299006 Uncharacterized ACR, COG1490 299007 7 transmembrane receptor (rhodopsin family) 299009 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 299021 Ribosomal protein S19e 299022 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 299025 14-3-3 protein 299026 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 299028 Mitochondrial carrier protein 299029 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299029 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299031 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 299031 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 299032 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 299035 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 299036 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 299037 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 299038 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 299039 Mitochondrial carrier protein 299040 Protein kinase domain 299041 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 299052 Fructose-bisphosphate aldolase class-I 299054 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 299055 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299055 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299056 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 299056 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 299058 Homeobox domain 299059 Proline dehydrogenase 299060 Insulinase (Peptidase family M16) 299061 Homeobox domain 299062 Ribosomal L15 299063 Ribosomal L15 299064 Homeobox domain 299068 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 299068 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 299069 PXA domain. This domain is associated with PX domains pfam00787 299070 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 299071 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 299073 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 299075 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 299076 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 299079 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 299080 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299080 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299081 Hsp90 protein 299082 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 299083 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 299084 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 299087 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 299090 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 299091 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299091 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299092 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 299094 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299094 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299094 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 299099 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 299100 Cyclophilin type peptidyl-prolyl cis-trans isomerase 299102 Ribosomal protein S6e 299103 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 299103 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 299104 FKBP-type peptidyl-prolyl cis-trans isomerase 299105 Eukaryotic porin 299106 Ribosomal L10 299106 Ribosomal protein L16 299107 Cyclophilin type peptidyl-prolyl cis-trans isomerase 299108 Ribosomal protein L5 299108 ribosomal L5P family C-terminus. This region is found associated with pfam00281 299110 Delta-aminolevulinic acid dehydratase 299111 Ribosomal protein S8e 299112 DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons 299116 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 299119 Domain of unknown function (DUF341) 299119 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 299125 Ribosomal protein S2 299126 Ribosomal L15 299128 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 299128 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299131 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 299134 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 299135 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 299136 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 299138 Homeobox domain 299142 C2 domain 299144 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 299145 ADP-ribosylation factor family 299145 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 299146 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 299147 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 299148 Ribosomal S17 299149 Ribosomal protein L31e 299150 Ribosomal S17 299156 ADP-ribosylation factor family 299156 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 299156 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 299157 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 299158 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 299159 ATP synthase subunit D. This is a family of subunit D form various ATP synthases including V-type H+ transporting and Na+ dependent. Subunit D is suggested to be an integral part of the catalytic sector of the V-ATPase 299163 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 299166 Ribosomal L39 protein 299167 Macrophage migration inhibitory factor (MIF) 299168 Core histone H2A/H2B/H3/H4 299171 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 299172 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 299172 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 299174 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 299175 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 299176 S-adenosyl-L-homocysteine hydrolase 299177 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299178 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 299182 Ribosomal protein L21e 299182 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 299184 Poly-adenylate binding protein, unique domain 299185 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 299188 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 299189 Phosphotyrosine interaction domain (PTB/PID) 299190 Ribosomal protein L23 299194 Zinc-binding dehydrogenase 299195 Monooxygenase. This family includes diverse enzymes that utilise FAD 299196 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 299197 Filamin/ABP280 repeat 299197 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 299201 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 299201 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 299202 MIT domain 299202 Protein kinase domain 299203 Acylphosphatase 299205 Ribosomal protein L21e 299206 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 299207 Uncharacterised protein family (UPF0143). This family of uncharacterised proteins are integral membrane proteins. They may contain 4 transmembrane helices. The family contains a conserved arginine and histidine that may be functionally important 299210 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 299210 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 299211 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 299212 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 299213 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299223 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 299224 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 299227 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299228 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 299232 metallopeptidase family M24 299236 Fibronectin type III domain 299236 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 299239 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299241 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 299242 7 transmembrane receptor (rhodopsin family) 299244 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 299251 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 299254 Ribosomal protein S2 299255 Kinesin motor domain 299266 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 299267 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 299270 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299271 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299271 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299272 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299273 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299274 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299276 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299277 ENV polyprotein (coat polyprotein) 299277 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299277 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299278 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299279 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299280 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299281 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299282 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299282 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299283 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 299284 RNase3 domain 299285 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 299286 Ribosomal protein L6e 299286 Ribosomal protein L6, N-terminal domain 299289 TCL1/MTCP1 family. Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28 299294 Ribosomal protein L21e 299298 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 299301 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 299301 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 299302 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299303 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299304 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299306 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 299307 Core histone H2A/H2B/H3/H4 299308 Ribosomal protein S5, C-terminal domain 299308 Ribosomal protein S5, N-terminal domain 299309 Fork head domain 299313 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b 299316 Mitochondrial carrier protein 299325 Ribosomal protein L3 299326 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 299327 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299327 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299328 Ribosomal protein S21e 299329 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 299331 Hsp90 protein 299331 Hsp90 protein 299331 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 299332 Protein kinase domain 299333 Ribosomal protein L44 299343 Fibrillarin 299343 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 299343 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 299346 NUDIX domain 299355 Marek's disease glycoprotein A 299358 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299360 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299362 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299366 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299366 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299367 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299368 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299369 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299372 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299375 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299376 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299382 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299384 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299386 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299388 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299389 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299390 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299391 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299393 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299397 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299398 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299400 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299402 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299404 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299405 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299406 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299409 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299411 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299413 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299415 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 299418 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299420 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299423 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299428 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299434 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299435 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299437 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299438 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299440 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299441 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299443 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299450 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299451 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299455 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299458 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299460 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299462 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299464 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299467 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299468 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299469 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299471 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299473 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299475 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299478 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299479 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299483 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299484 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299488 C2 domain 299489 C2 domain 299491 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 299491 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 299492 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 299496 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 299497 Macrophage migration inhibitory factor (MIF) 299498 Protein kinase domain 299500 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299500 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299501 Yippee putative zinc-binding protein 299502 GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein 299503 Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer 299504 Glutamine amidotransferase class-I 299505 Aminotransferase class I and II 299510 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 299511 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 299512 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 299513 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 299513 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 299514 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 299519 Translationally controlled tumor protein 299520 Ribosomal protein S5, C-terminal domain 299520 Ribosomal protein S5, N-terminal domain 299523 Hsp90 protein 299524 Hsp90 protein 299526 Heme oxygenase 299530 7 transmembrane receptor (rhodopsin family) 299535 Translationally controlled tumor protein 299535 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 299538 Ribosomal protein S7e 299544 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299546 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299551 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299552 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 299553 7 transmembrane receptor (rhodopsin family) 299555 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 299556 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 299557 jmjC domain 299557 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 299557 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299558 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299558 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299560 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299562 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 299564 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 299566 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 299570 7 transmembrane receptor (rhodopsin family) 299572 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299573 7 transmembrane receptor (rhodopsin family) 299574 7 transmembrane receptor (rhodopsin family) 299575 7 transmembrane receptor (rhodopsin family) 299576 7 transmembrane receptor (rhodopsin family) 299578 7 transmembrane receptor (metabotropic glutamate family) 299578 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 299578 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299579 7 transmembrane receptor (metabotropic glutamate family) 299579 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 299581 7 transmembrane receptor (metabotropic glutamate family) 299582 7 transmembrane receptor (rhodopsin family) 299583 7 transmembrane receptor (metabotropic glutamate family) 299586 7 transmembrane receptor (rhodopsin family) 299587 ICE-like protease (caspase) p10 domain 299587 ICE-like protease (caspase) p20 domain 299588 7 transmembrane receptor (rhodopsin family) 299589 7 transmembrane receptor (rhodopsin family) 299590 7 transmembrane receptor (rhodopsin family) 299591 7 transmembrane receptor (rhodopsin family) 299592 7 transmembrane receptor (rhodopsin family) 299593 7 transmembrane receptor (rhodopsin family) 299594 7 transmembrane receptor (rhodopsin family) 299595 7 transmembrane receptor (rhodopsin family) 299596 7 transmembrane receptor (rhodopsin family) 299597 7 transmembrane receptor (rhodopsin family) 299598 7 transmembrane receptor (metabotropic glutamate family) 299602 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 299603 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 299604 DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases 299606 Trypsin 299609 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 299610 RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain 299610 RNA polymerase Rpb5, N-terminal domain. Rpb5 has a bipartite structure which includes a eukaryote-specific N-terminal domain and a C-terminal domain resembling the archaeal RNAP subunit H. The N-terminal domain is involved in DNA binding and is part of th 299614 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 299615 Homeobox domain 299617 Dienelactone hydrolase family 299617 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 299622 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299622 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299623 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 299624 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299625 Intermediate filament protein 299626 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 299627 pfam02891, zf-MIZ, MIZ zinc finger 299632 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 299636 Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation 299638 Sir2 family 299640 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 299644 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 299649 7 transmembrane receptor (rhodopsin family) 299649 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 299650 7 transmembrane receptor (rhodopsin family) 299650 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 299651 Uncharacterized protein family UPF0023 299652 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 299653 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 299653 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 299654 7 transmembrane receptor (metabotropic glutamate family) 299655 7 transmembrane receptor (metabotropic glutamate family) 299655 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 299657 7 transmembrane receptor (metabotropic glutamate family) 299659 7 transmembrane receptor (metabotropic glutamate family) 299660 Protein kinase domain 299662 Protein kinase domain 299663 Protein kinase domain 299664 Protein kinase domain 299667 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 299668 Protein kinase domain 299670 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 299671 Protein kinase domain 299672 Protein kinase domain 299673 Protein kinase domain 299674 Protein kinase domain 299676 Protein kinase domain 299680 7 transmembrane receptor (metabotropic glutamate family) 299682 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 299684 7 transmembrane receptor (rhodopsin family) 299685 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 299685 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 299686 Uncharacterized protein family UPF0027 299687 Helix-loop-helix DNA-binding domain 299689 'Cold-shock' DNA-binding domain 299692 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase comp 299694 Protein kinase domain 299698 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 299699 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 299703 HMG (high mobility group) box 299705 14-3-3 protein 299706 WD domain, G-beta repeat 299710 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 299711 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299711 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299712 Ribosomal protein S5, C-terminal domain 299712 Ribosomal protein S5, N-terminal domain 299713 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 299718 Cyclophilin type peptidyl-prolyl cis-trans isomerase 299724 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 299726 Ribosomal protein L21e 299727 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299727 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299732 Proteasome A-type and B-type 299733 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 299737 Laminin N-terminal (Domain VI) 299737 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 299737 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 299740 Ribosomal protein L31e 299744 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299744 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299745 Transaldolase 299746 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299746 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299749 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 299749 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 299749 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 299750 Citrate synthase 299752 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299754 Ribosomal protein S15 299756 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299758 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299761 ENV polyprotein (coat polyprotein) 299762 TPR Domain 299763 Intermediate filament protein 299766 Helix-loop-helix DNA-binding domain 299766 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s 299767 Helix-loop-helix DNA-binding domain 299767 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s 299768 Ribosomal protein S6e 299769 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 299772 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 299775 Ribosomal protein L44 299776 Mitosis protein DIM1 299777 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 299779 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 299780 Oxysterol-binding protein 299783 SCP-like extracellular protein. This domain is also found in prokaryotes 299784 SCP-like extracellular protein. This domain is also found in prokaryotes 299786 ENV polyprotein (coat polyprotein) 299788 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 299790 ADP-ribosylation factor family 299791 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 299792 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 299796 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 299797 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 299799 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 299800 Integral membrane protein DUF92. Members of this family have several predicted transmembrane helices. The function of these prokaryotic proteins is unknown 299801 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 299802 7 transmembrane receptor (rhodopsin family) 299804 Ribosomal protein L19e 299805 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 299806 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 299808 Uncharacterized protein family UPF0027 299809 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 299810 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 299813 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 299815 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299815 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299820 Protein-tyrosine phosphatase 299820 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 299822 Ribosomal L29e protein family 299823 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 299824 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 299825 Sulfatase 299827 Protein kinase domain 299832 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 299833 Ribosomal protein L19e 299835 Actin 299840 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 299842 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 299843 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 299844 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 299845 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 299846 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 299847 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 299848 Ribosomal protein S26e 299851 WHEP-TRS domain 299852 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299853 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299853 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299854 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 299857 Serine hydroxymethyltransferase 299868 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 299868 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 299870 ENV polyprotein (coat polyprotein) 299870 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 299872 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 299875 Intermediate filament protein 299878 Cadherin domain 299879 Fibrinogen beta and gamma chains, C-terminal globular domain 299881 Fibrinogen beta and gamma chains, C-terminal globular domain 299886 Ribosomal L38e protein family 299888 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299888 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299890 CUB domain 299891 CUB domain 299891 Sushi domain (SCR repeat) 299892 CUB domain 299893 Sushi domain (SCR repeat) 299894 Sushi domain (SCR repeat) 299899 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 299901 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 299903 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 299904 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 299906 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 299913 Proteasome A-type and B-type 299914 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 299915 Protein kinase domain 299917 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 299919 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 299921 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 299923 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 299923 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to 299927 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299928 Lectin C-type domain. This family includes both long and short form C-type 299928 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 299929 Lectin C-type domain. This family includes both long and short form C-type 299929 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 299930 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299930 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299931 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 299934 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 299935 Ribosomal protein L31e 299936 Translation initiation factor SUI1 299937 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 299938 Ribosomal protein L13 299942 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 299943 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 299944 Sugar (and other) transporter 299945 Mitochondrial carrier protein 299947 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 299948 Ribosomal protein L31e 299951 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 299952 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 299958 Isocitrate/isopropylmalate dehydrogenase 299961 GCM motif protein 299962 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 299964 NAD-dependent glycerol-3-phosphate dehydrogenase 299965 ENV polyprotein (coat polyprotein) 299968 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 299968 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 299969 WD domain, G-beta repeat 299969 Sof1-like domain. Sof1 is essential for cell growth and is a component of the nucleolar rRNA processing machinery 299970 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 299971 V-ATPase subunit C 299972 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 299976 Ribonucleotide reductase, small chain 299980 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 299982 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 299983 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 299983 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 299985 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 299988 ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisin 299990 Protein kinase domain 299991 Protein kinase domain 299995 ENV polyprotein (coat polyprotein) 299996 von Willebrand factor type A domain 300001 Ribosomal protein L36e 300005 Actin 300007 Core histone H2A/H2B/H3/H4 300009 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 300011 Ribosomal protein S19e 300014 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300015 4-hydroxybenzoyl-CoA thioesterase. This enzyme (EC 3.1.2.23) catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3 300016 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300017 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300018 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300020 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300021 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300023 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300024 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300025 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 300026 Ribosomal protein L21e 300029 Eukaryotic DNA topoisomerase I, catalytic core. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombina 300030 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 300031 Ribosomal protein L21e 300033 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 300034 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 300034 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 300035 Delta 1-pyrroline-5-carboxylate reductase 300036 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 300036 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 300044 Adenylate and Guanylate cyclase catalytic domain 300045 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 300046 Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly s 300047 Cytochrome C1 family 300050 WD domain, G-beta repeat 300051 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 300052 VPS28 protein 300053 Kinesin motor domain 300054 Fork head domain 300055 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 300062 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 300063 Octicosapeptide repeat. Short motif that may bind Ca2+ 300066 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 300068 Aminotransferase class I and II 300071 Domain of unknown function 300077 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 300078 Dynein light chain type 1 300079 Ribosomal protein L3 300082 Ribosomal protein L21e 300083 Mitochondrial carrier protein 300088 RNA polymerase Rpb7, N-terminal domain. Rpb7 bind to Rpb4 to form a heterodimer. This complex is thought to interact with the nascent RNA strand during Pol II elongation 300089 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 300090 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 300092 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 300093 Cytochrome C oxidase subunit II, periplasmic domain 300095 Helix-loop-helix DNA-binding domain 300099 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 300101 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300103 Ribosomal protein L36e 300104 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 300105 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 300108 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 300111 Bacterial surface antigen. This entry includes the following surface antigens 300112 Bacterial surface antigen. This entry includes the following surface antigens 300115 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 300117 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 300121 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 300123 Cyclophilin type peptidyl-prolyl cis-trans isomerase 300127 7 transmembrane receptor (Secretin family) 300127 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 300127 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 300128 Cadherin domain 300144 Protein kinase domain 300147 PH domain. PH stands for pleckstrin homology 300150 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 300156 Ribosomal protein S19e 300156 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 300158 WD domain, G-beta repeat 300161 Protein kinase domain 300162 HMG (high mobility group) box 300165 Respiratory-chain NADH dehydrogenase, 49 Kd subunit 300170 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 300172 LIM domain. This family represents two copies of the LIM structural domain 300173 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 300174 Ribosomal protein L35Ae 300175 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 300177 Protein kinase domain 300178 Homeobox domain 300179 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 300181 Ribosomal protein S5, N-terminal domain 300182 Ribosomal protein S5, N-terminal domain 300185 Ribosomal protein L31e 300187 NAC domain 300188 Translationally controlled tumor protein 300194 7 transmembrane receptor (rhodopsin family) 300195 7 transmembrane receptor (rhodopsin family) 300197 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 300197 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 300198 7 transmembrane receptor (rhodopsin family) 300199 7 transmembrane receptor (rhodopsin family) 300200 7 transmembrane receptor (rhodopsin family) 300202 7 transmembrane receptor (rhodopsin family) 300203 7 transmembrane receptor (rhodopsin family) 300204 7 transmembrane receptor (rhodopsin family) 300205 7 transmembrane receptor (rhodopsin family) 300208 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 300211 FKBP-type peptidyl-prolyl cis-trans isomerase 300212 wnt family 300216 Ribosomal protein L13e 300217 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 300217 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 300220 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 300222 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 300224 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 300227 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 300231 AMP-binding enzyme 300232 Trypsin 300232 Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are 300238 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 300239 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 300241 S25 ribosomal protein 300242 Intermediate filament protein 300243 Intermediate filament protein 300244 Intermediate filament protein 300245 Intermediate filament protein 300245 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 300246 Intermediate filament protein 300246 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 300247 Intermediate filament protein 300247 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 300247 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 300248 Intermediate filament protein 300248 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 300249 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 300250 Intermediate filament protein 300251 Intermediate filament protein 300251 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 300255 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 300257 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b 300258 Uncharacterised protein family (UPF0160). This family of proteins contains a large number of metal binding residues. The patterns are suggestive of a phosphoesterase function. The conserved DHH motif may mean this family is related to pfam01368 300260 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b 300260 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 300262 Homeobox domain 300263 Homeobox domain 300264 Homeobox domain 300264 Occludin/ELL family 300266 'chromo' (CHRromatin Organization MOdifier) domain 300266 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 300268 Zinc finger, C3HC4 type (RING finger) 300271 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 300274 Zinc finger, C3HC4 type (RING finger) 300274 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 300275 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 300276 C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome 300277 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300278 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 300278 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 300283 Ribosomal protein L10 300284 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 300291 Actin 300295 Hsp90 protein 300296 Cyclophilin type peptidyl-prolyl cis-trans isomerase 300300 Stathmin family 300300 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 300302 HIT family 300303 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 300305 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 300306 Cyclophilin type peptidyl-prolyl cis-trans isomerase 300306 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 300307 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 300308 Protein kinase domain 300313 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 300315 Ribosomal protein L19e 300320 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 300321 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 300323 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 300328 4'-phosphopantetheinyl transferase superfamily. Members of this family transfers the 4'-phosphopantetheine (4'-PP) moiety from coenzyme A (CoA) to the invariant serine of pfam00550. This post-translational modification renders holo-ACP capable of acyl gro 300329 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 300330 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 300331 Phosphoglycerate kinase 300332 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 300334 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 300335 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 300337 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 300338 Fragilysin metallopeptidase (M10C) enterotoxin 300338 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 300338 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 300338 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 300339 Fragilysin metallopeptidase (M10C) enterotoxin 300339 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 300339 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 300339 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family 300339 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 300340 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 300340 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 300340 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 300341 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 300341 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 300341 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 300344 Galactoside-binding lectin 300345 ENV polyprotein (coat polyprotein) 300346 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 300347 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 300347 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 300347 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 300350 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 300350 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 300352 ENV polyprotein (coat polyprotein) 300353 Protein kinase domain 300353 Protein kinase C terminal domain 300354 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 300355 ENV polyprotein (coat polyprotein) 300357 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 300358 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300358 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300363 Peptidase family M49 300364 Peptidase family M49 300365 Ribosomal protein S24e 300377 7 transmembrane receptor (rhodopsin family) 300378 7 transmembrane receptor (rhodopsin family) 300379 7 transmembrane receptor (rhodopsin family) 300380 7 transmembrane receptor (rhodopsin family) 300381 HMG (high mobility group) box 300382 EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the 300383 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 300384 Peptidase family M28D 300384 Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure 300385 7 transmembrane receptor (rhodopsin family) 300386 HMG (high mobility group) box 300388 Ribosomal L15 300391 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 300392 7 transmembrane receptor (rhodopsin family) 300393 7 transmembrane receptor (rhodopsin family) 300394 7 transmembrane receptor (rhodopsin family) 300394 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 300395 7 transmembrane receptor (rhodopsin family) 300396 7 transmembrane receptor (rhodopsin family) 300397 7 transmembrane receptor (rhodopsin family) 300398 7 transmembrane receptor (rhodopsin family) 300399 7 transmembrane receptor (rhodopsin family) 300400 7 transmembrane receptor (rhodopsin family) 300401 7 transmembrane receptor (rhodopsin family) 300402 7 transmembrane receptor (rhodopsin family) 300403 7 transmembrane receptor (rhodopsin family) 300404 7 transmembrane receptor (rhodopsin family) 300405 7 transmembrane receptor (rhodopsin family) 300406 7 transmembrane receptor (rhodopsin family) 300407 7 transmembrane receptor (rhodopsin family) 300408 7 transmembrane receptor (rhodopsin family) 300409 7 transmembrane receptor (rhodopsin family) 300410 7 transmembrane receptor (rhodopsin family) 300411 7 transmembrane receptor (rhodopsin family) 300413 7 transmembrane receptor (rhodopsin family) 300414 7 transmembrane receptor (rhodopsin family) 300415 7 transmembrane receptor (rhodopsin family) 300416 7 transmembrane receptor (rhodopsin family) 300417 7 transmembrane receptor (rhodopsin family) 300417 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 300418 7 transmembrane receptor (rhodopsin family) 300419 7 transmembrane receptor (rhodopsin family) 300420 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 300421 7 transmembrane receptor (rhodopsin family) 300422 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 300423 7 transmembrane receptor (rhodopsin family) 300425 7 transmembrane receptor (rhodopsin family) 300426 7 transmembrane receptor (rhodopsin family) 300426 'chromo' (CHRromatin Organization MOdifier) domain 300427 7 transmembrane receptor (rhodopsin family) 300428 7 transmembrane receptor (rhodopsin family) 300430 Fork head domain 300431 7 transmembrane receptor (rhodopsin family) 300432 ENV polyprotein (coat polyprotein) 300433 7 transmembrane receptor (rhodopsin family) 300434 7 transmembrane receptor (rhodopsin family) 300435 7 transmembrane receptor (rhodopsin family) 300436 7 transmembrane receptor (rhodopsin family) 300438 Low-density lipoprotein receptor domain class A 300438 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 300438 Low-density lipoprotein receptor domain class A 300438 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 300444 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 300444 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 300446 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 300448 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 300453 S25 ribosomal protein 300455 von Willebrand factor type D domain 300455 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 300458 7 transmembrane receptor (rhodopsin family) 300465 Triosephosphate isomerase 300467 Glycosyl hydrolases family 35 300472 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 300473 Eukaryotic porin 300474 Thrombospondin type 1 domain 300474 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 300474 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 300475 Thrombospondin type 1 domain 300475 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 300476 Ribosomal L29e protein family 300478 Ribosomal protein L21e 300479 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 300479 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 300480 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 300481 Ribosomal protein S19e 300482 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 300482 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 300484 Cyclophilin type peptidyl-prolyl cis-trans isomerase 300485 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 300489 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 300490 Fibronectin type III domain 300491 Enolase, N-terminal domain 300491 Enolase, C-terminal TIM barrel domain 300491 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 300511 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 300515 Ribosomal protein L34e 300518 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 300518 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 300522 7 transmembrane receptor (rhodopsin family) 300523 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 300524 7 transmembrane receptor (rhodopsin family) 300525 7 transmembrane receptor (rhodopsin family) 300526 7 transmembrane receptor (rhodopsin family) 300527 7 transmembrane receptor (rhodopsin family) 300528 7 transmembrane receptor (rhodopsin family) 300529 7 transmembrane receptor (rhodopsin family) 300530 7 transmembrane receptor (rhodopsin family) 300531 7 transmembrane receptor (rhodopsin family) 300533 Hsp90 protein 300533 7 transmembrane receptor (rhodopsin family) 300533 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 300534 7 transmembrane receptor (rhodopsin family) 300535 7 transmembrane receptor (rhodopsin family) 300536 7 transmembrane receptor (rhodopsin family) 300537 7 transmembrane receptor (rhodopsin family) 300538 7 transmembrane receptor (rhodopsin family) 300539 7 transmembrane receptor (rhodopsin family) 300540 7 transmembrane receptor (rhodopsin family) 300543 Myosin head (motor domain) 300545 7 transmembrane receptor (rhodopsin family) 300546 7 transmembrane receptor (rhodopsin family) 300547 7 transmembrane receptor (rhodopsin family) 300548 7 transmembrane receptor (rhodopsin family) 300549 7 transmembrane receptor (rhodopsin family) 300550 7 transmembrane receptor (rhodopsin family) 300551 7 transmembrane receptor (rhodopsin family) 300552 7 transmembrane receptor (rhodopsin family) 300554 7 transmembrane receptor (rhodopsin family) 300556 7 transmembrane receptor (rhodopsin family) 300557 7 transmembrane receptor (rhodopsin family) 300558 7 transmembrane receptor (rhodopsin family) 300559 7 transmembrane receptor (rhodopsin family) 300560 7 transmembrane receptor (rhodopsin family) 300561 7 transmembrane receptor (rhodopsin family) 300562 7 transmembrane receptor (rhodopsin family) 300563 7 transmembrane receptor (rhodopsin family) 300564 7 transmembrane receptor (rhodopsin family) 300565 7 transmembrane receptor (rhodopsin family) 300566 7 transmembrane receptor (rhodopsin family) 300567 7 transmembrane receptor (rhodopsin family) 300568 7 transmembrane receptor (rhodopsin family) 300569 7 transmembrane receptor (rhodopsin family) 300570 7 transmembrane receptor (rhodopsin family) 300571 7 transmembrane receptor (rhodopsin family) 300572 7 transmembrane receptor (rhodopsin family) 300573 7 transmembrane receptor (rhodopsin family) 300574 7 transmembrane receptor (rhodopsin family) 300575 7 transmembrane receptor (rhodopsin family) 300576 7 transmembrane receptor (rhodopsin family) 300577 7 transmembrane receptor (rhodopsin family) 300578 7 transmembrane receptor (rhodopsin family) 300579 7 transmembrane receptor (rhodopsin family) 300580 7 transmembrane receptor (rhodopsin family) 300581 7 transmembrane receptor (rhodopsin family) 300582 7 transmembrane receptor (rhodopsin family) 300583 7 transmembrane receptor (rhodopsin family) 300584 7 transmembrane receptor (rhodopsin family) 300585 7 transmembrane receptor (rhodopsin family) 300587 7 transmembrane receptor (rhodopsin family) 300588 7 transmembrane receptor (rhodopsin family) 300589 7 transmembrane receptor (rhodopsin family) 300590 7 transmembrane receptor (rhodopsin family) 300591 7 transmembrane receptor (rhodopsin family) 300592 7 transmembrane receptor (rhodopsin family) 300593 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 300595 7 transmembrane receptor (rhodopsin family) 300596 7 transmembrane receptor (rhodopsin family) 300597 7 transmembrane receptor (rhodopsin family) 300598 7 transmembrane receptor (rhodopsin family) 300599 7 transmembrane receptor (rhodopsin family) 300600 7 transmembrane receptor (rhodopsin family) 300602 7 transmembrane receptor (rhodopsin family) 300603 7 transmembrane receptor (rhodopsin family) 300604 7 transmembrane receptor (rhodopsin family) 300605 7 transmembrane receptor (rhodopsin family) 300606 7 transmembrane receptor (rhodopsin family) 300607 7 transmembrane receptor (rhodopsin family) 300609 7 transmembrane receptor (rhodopsin family) 300610 7 transmembrane receptor (rhodopsin family) 300613 7 transmembrane receptor (rhodopsin family) 300614 7 transmembrane receptor (rhodopsin family) 300615 7 transmembrane receptor (rhodopsin family) 300616 7 transmembrane receptor (rhodopsin family) 300617 Ribosomal protein S5, C-terminal domain 300617 Ribosomal protein S5, N-terminal domain 300618 7 transmembrane receptor (rhodopsin family) 300620 7 transmembrane receptor (rhodopsin family) 300621 7 transmembrane receptor (rhodopsin family) 300622 7 transmembrane receptor (rhodopsin family) 300623 7 transmembrane receptor (rhodopsin family) 300624 7 transmembrane receptor (rhodopsin family) 300625 7 transmembrane receptor (rhodopsin family) 300626 7 transmembrane receptor (rhodopsin family) 300627 7 transmembrane receptor (rhodopsin family) 300628 7 transmembrane receptor (rhodopsin family) 300629 7 transmembrane receptor (rhodopsin family) 300630 7 transmembrane receptor (rhodopsin family) 300631 7 transmembrane receptor (rhodopsin family) 300632 7 transmembrane receptor (rhodopsin family) 300633 7 transmembrane receptor (rhodopsin family) 300634 7 transmembrane receptor (rhodopsin family) 300635 7 transmembrane receptor (rhodopsin family) 300636 7 transmembrane receptor (rhodopsin family) 300637 7 transmembrane receptor (rhodopsin family) 300638 7 transmembrane receptor (rhodopsin family) 300639 7 transmembrane receptor (rhodopsin family) 300640 7 transmembrane receptor (rhodopsin family) 300643 7 transmembrane receptor (rhodopsin family) 300645 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 300646 Ribosomal L15 300647 Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome 300647 EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the 300648 Homeobox domain 300651 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300651 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300653 Zona pellucida-like domain 300653 von Willebrand factor type D domain 300653 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 300654 Ribosomal L29e protein family 300655 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 300656 B-box zinc finger 300657 B-box zinc finger 300660 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 300662 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 300668 Core histone H2A/H2B/H3/H4 300669 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 300679 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 300680 Trypsin 300681 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 300682 Trypsin 300682 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 300685 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other 300686 ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other 300688 Ribosomal protein S21e 300689 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 300691 NNMT/PNMT/TEMT family 300692 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 300697 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300697 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300698 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 300701 HMG14 and HMG17 300706 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 300707 NAC domain 300709 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300709 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300710 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 300710 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 300714 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 300715 WD domain, G-beta repeat 300716 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300716 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300718 SH2 domain 300721 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 300721 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 300722 WD domain, G-beta repeat 300723 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 300726 Electron transfer flavoprotein alpha subunit. This protein is distantly related to and forms a heterodimer with pfam01012 300728 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 300729 Sodium / potassium ATPase beta chain 300731 Ribosomal protein L21e 300732 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 300733 Tetraspanin family 300735 Ribosomal protein S8 300736 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 300737 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 300738 Protein-tyrosine phosphatase 300739 Eukaryotic porin 300740 Ribosomal protein L11, RNA binding domain 300741 Phosphomannose isomerase type I 300742 MIT domain 300742 Protein kinase domain 300744 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 300745 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 300748 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 300748 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 300757 Glycosyl hydrolase family 20, domain 2. This domain has a zincin-like fold 300757 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 300758 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 300767 Ribosomal protein S28e 300770 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 300771 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 300774 Cyclophilin type peptidyl-prolyl cis-trans isomerase 300775 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 300778 RNB-like protein. The function of this region of similarity is uncertain 300784 Fibronectin type III domain 300785 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 300789 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 300789 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 300792 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 300794 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 300796 PX domain. PX domains bind to phosphoinositides 300797 Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenyla 300798 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 300799 Protein kinase domain 300800 Ribosomal L15 300807 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 300810 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 300812 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 300812 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 300815 Disintegrin 300815 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 300816 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300817 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 300821 Ribosomal L29 protein 300823 Ribosomal protein S7e 300824 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 300827 Ribosomal protein L24e 300828 Translation initiation factor SUI1 300831 C2 domain 300831 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 300832 Ribosomal protein L34e 300833 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 300835 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 300838 Fimbrial Usher protein. This protein is involved in biogenesis of gram negative bacterial pili 300838 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 300840 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 300841 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron 300844 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 300845 Papain family cysteine protease 300845 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 300850 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 300850 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 300852 Glucose inhibited division protein A 300853 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m 300856 Ribosomal S3Ae family 300857 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 300860 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 300861 14-3-3 protein 300863 Translationally controlled tumor protein 300867 Ribosomal protein L11, RNA binding domain 300867 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 300868 Proteasome A-type and B-type 300871 Ribosomal protein L31e 300872 ENV polyprotein (coat polyprotein) 300874 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 300876 Stathmin family 300877 Phosphoglucomutase/phosphomannomutase, C-terminal domain 300881 Ribosomal protein S8 300882 Ribosomal protein S19 300884 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 300884 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 300886 5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF i 300888 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 300889 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 300890 Thymidine kinase 300895 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300895 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300898 Ribosomal protein L31e 300899 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 300900 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 300901 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 300902 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 300903 Ribosomal protein S2 300904 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300904 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300908 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 300913 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 300914 Eukaryotic ribosomal protein L18 300915 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 300920 PMP-22/EMP/MP20/Claudin family 300924 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 300926 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 300928 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 300929 Ribosomal protein L19e 300930 Fibrillarin 300931 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300931 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 300932 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 300933 Metallo-beta-lactamase superfamily 300935 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 300935 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 300939 Hsp90 protein 300940 Calx-beta domain 300940 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 300943 Phosphatidylinositol 3- and 4-kinase 300943 FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal 300945 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 300946 Ribosomal protein L31e 300947 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 300949 Cyclophilin type peptidyl-prolyl cis-trans isomerase 300950 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 300951 Apoptosis regulator proteins, Bcl-2 family 300953 PMP-22/EMP/MP20/Claudin family 300954 HMG (high mobility group) box 300955 SH2 domain 300955 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 300956 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 300957 Stromal antigen (SA/STAG) protein 300959 Ribosomal protein L19e 300960 Protein kinase domain 300960 Fibronectin type III domain 300960 Giardia variant-specific surface protein 300960 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 300960 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 300962 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 300964 Transferrin 300965 ADP-ribosylation factor family 300965 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 300968 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 300969 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 300970 Ribosomal protein S6e 300972 C2 domain 300973 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 300974 Ribosomal protein L3 300975 NUDIX domain 300978 WD domain, G-beta repeat 300979 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 300981 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 300983 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 300985 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 300992 Cache domain 300993 Hyaluronidase 300995 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 300996 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 300997 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 300998 Enolase, N-terminal domain 300998 Enolase, C-terminal TIM barrel domain 300999 Protein kinase domain 300999 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 300999 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 300999 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 301003 tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a 301005 FMN-dependent dehydrogenase 301005 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor 301005 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 301015 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 301016 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 301017 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 301021 Ribosomal protein L13e 301025 Protein-tyrosine phosphatase 301026 Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins 301027 Trypsin 301027 Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are 301028 Trypsin 301034 Transferrin 301039 Extracellular link domain 301042 B-box zinc finger 301042 Filamin/ABP280 repeat 301042 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 301042 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 301042 NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph 301047 NAD-dependent glycerol-3-phosphate dehydrogenase 301048 Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase 301050 Aminotransferase class-V 301050 Aminotransferase class I and II 301050 Pyridoxal-dependent decarboxylase conserved domain 301050 Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses 301051 Pyridoxal-dependent decarboxylase conserved domain 301053 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 301056 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 301057 Villin headpiece domain 301058 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 301059 Death domain 301059 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 301060 Protein kinase domain 301060 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t 301063 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 301066 7 transmembrane receptor (rhodopsin family) 301067 Mitochondrial carrier protein 301068 Translation initiation factor SUI1 301069 Ribosomal protein S19 301073 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 301076 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 301076 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301077 Ribosomal protein L21e 301078 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 301081 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 301083 Sodium:neurotransmitter symporter family 301084 7 transmembrane receptor (rhodopsin family) 301085 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 301085 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 301086 7 transmembrane receptor (rhodopsin family) 301087 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 301088 7 transmembrane receptor (rhodopsin family) 301089 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 301090 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 301091 7 transmembrane receptor (rhodopsin family) 301093 7 transmembrane receptor (rhodopsin family) 301096 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 301106 Fibronectin type III domain 301107 Transferrin 301108 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 301108 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 301113 Mitochondrial carrier protein 301114 Mitochondrial carrier protein 301115 Adenylate kinase 301115 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 301117 Clp protease. The Clp protease has an active site catalytic triad. In E. coli Clp protease, ser-111, his-136 and asp-185 form the catalytic triad. One member has lost all of these active site residues and is therefore inactive, others contain one or two l 301120 AMP-binding enzyme 301122 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour 301125 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 301127 Zinc finger, C3HC4 type (RING finger) 301130 Dienelactone hydrolase family 301132 TNF(Tumor Necrosis Factor) family 301133 TNF(Tumor Necrosis Factor) family 301134 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301134 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301135 HMG (high mobility group) box 301136 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 301137 7 transmembrane receptor (metabotropic glutamate family) 301137 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 301138 7 transmembrane receptor (metabotropic glutamate family) 301139 Hsp90 protein 301140 7 transmembrane receptor (metabotropic glutamate family) 301143 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 301149 7 transmembrane receptor (Secretin family) 301149 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 301149 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301152 Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Al 301154 Protein kinase domain 301154 7 transmembrane receptor (metabotropic glutamate family) 301154 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 301155 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 301156 Enolase, N-terminal domain 301156 Enolase, C-terminal TIM barrel domain 301158 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 301160 Transaldolase 301160 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 301163 Actin 301164 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 301165 Protein kinase domain 301166 Protein kinase domain 301170 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 301174 Ribosomal L10 301218 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 301221 Radical SAM superfamily 301223 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 301224 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 301225 Death domain 301225 ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function 301231 Ribosomal protein L13 301234 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 301236 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 301237 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 301240 Ribosomal Proteins L2, C-terminal domain 301240 Ribosomal Proteins L2, RNA binding domain 301241 Protein kinase domain 301244 LIM domain. This family represents two copies of the LIM structural domain 301248 Ribosomal protein S5, C-terminal domain 301248 Ribosomal protein S5, N-terminal domain 301249 Ribosomal protein S18 301250 Ribosomal protein L14p/L23e 301251 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 301252 Hsp90 protein 301252 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 301254 tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases 301259 Runt domain 301261 Sulfatase 301261 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 301262 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 301264 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 301265 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 301269 7 transmembrane receptor (Secretin family) 301269 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 301270 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 301271 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 301272 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 301274 Ribosomal protein L21e 301275 B12 binding domain. This domain binds to B12 (adenosylcobamide), it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase 301276 Methylmalonyl-CoA mutase. The enzyme methylmalonyl-CoA mutase is a member of a class of enzymes that uses coenzyme B12 (adenosylcobalamin) as a cofactor. The enzyme induces the formation of an adenosyl radical from the cofactor. This radical then initiate 301280 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 301281 Cullin family 301282 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 301283 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301284 Transcription factor AP-2 301285 Transcription factor AP-2 301290 Cyclophilin type peptidyl-prolyl cis-trans isomerase 301296 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 301297 Ribosomal L39 protein 301299 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 301300 P53 301303 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 301307 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 301310 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 301312 Thrombospondin N-terminal -like domain 301314 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 301319 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 301319 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 301320 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 301322 Zn-finger in Ran binding protein and others 301323 Eukaryotic-type DNA primase, large subunit. DNA primase is the polymerase that synthesises small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa 301324 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 301325 BAG domain. Domain present in Hsp70 regulators 301327 Ribosomal protein L36e 301328 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 301329 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 301333 Protein-tyrosine phosphatase 301336 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 301338 Ribosomal protein S11 301339 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 301341 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 301343 Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-2 301346 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 301346 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 301352 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 301353 Transglycosylase SLT domain. This family is distantly related to pfam00062 301360 Zinc finger, C3HC4 type (RING finger) 301360 ATP-dependent protease La (LON) domain 301361 Zinc finger, C3HC4 type (RING finger) 301361 ATP-dependent protease La (LON) domain 301363 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 301364 Core histone H2A/H2B/H3/H4 301365 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 301366 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 301371 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 301372 7 transmembrane receptor (rhodopsin family) 301379 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 301381 Dienelactone hydrolase family 301384 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 301387 Ribosomal protein S12 301388 LacY proton/sugar symporter. This family is closely related to the sugar transporter family 301390 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301390 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 301392 Ribosomal protein L11, RNA binding domain 301393 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 301395 Ribosomal protein S7e 301399 Hsp90 protein 301401 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 301405 Enolase, N-terminal domain 301405 Enolase, C-terminal TIM barrel domain 301407 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 301408 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 301412 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 301414 Ribosomal protein L31e 301415 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 301416 Uncharacterised protein family (UPF0083) 301417 Ribosomal protein L6e 301424 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301424 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301426 Protein kinase domain 301429 ICE-like protease (caspase) p20 domain 301430 Origin recognition complex subunit 2. All DNA replication initiation is driven by a single conserved eukaryotic initiator complex termed he origin recognition complex (ORC). The ORC is a six protein complex. The function of ORC is reviewed in 301431 Domain of unknown function DUF34. One member of this family NIF3 (NGG1p interacting factor 3) interacts with the yeast transcriptional coactivator NGG1p which is part of the ADA complex the exact function of this interaction is unknown 301432 Cyclophilin type peptidyl-prolyl cis-trans isomerase 301433 Protein kinase domain 301434 Protein kinase domain 301435 [2Fe-2S] binding domain 301435 CO dehydrogenase flavoprotein C-terminal domain 301435 FAD binding domain in molybdopterin dehydrogenase 301435 Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain 301435 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 301436 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 301438 Ribosomal protein S5, C-terminal domain 301438 Ribosomal protein S5, N-terminal domain 301439 Protein kinase domain 301440 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 301440 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are 301442 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301444 Acyl CoA binding protein 301445 WD domain, G-beta repeat 301447 Beige/BEACH domain 301448 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 301449 Ribosomal protein S5, C-terminal domain 301450 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 301455 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 301458 Molybdopterin oxidoreductase 301459 Cytochrome c oxidase subunit Vb 301461 Disintegrin 301461 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 301462 Ribosomal L29e protein family 301468 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 301469 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 301471 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 301472 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 301475 Enolase, N-terminal domain 301475 Enolase, C-terminal TIM barrel domain 301484 Microtubule associated protein (MAP65/ASE1 family) 301484 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301485 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 301486 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 301488 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 301489 SCP-like extracellular protein. This domain is also found in prokaryotes 301490 ENV polyprotein (coat polyprotein) 301491 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301492 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301494 Mitosis protein DIM1 301495 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 301496 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301496 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301499 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301499 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 301501 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301501 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 301502 Mitosis protein DIM1 301502 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 301507 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 301510 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 301510 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 301511 Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4 301512 WD domain, G-beta repeat 301513 Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes 301514 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 301517 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 301517 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 301518 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 301518 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 301519 Cyclophilin type peptidyl-prolyl cis-trans isomerase 301520 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 301522 ATP synthase subunit C 301525 Glycosyl hydrolases family 35 301526 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 301527 Protein-tyrosine phosphatase 301528 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 301529 Aminopeptidase I zinc metalloprotease (M18) 301530 Aminopeptidase I zinc metalloprotease (M18) 301531 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 301532 Protein kinase domain 301533 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 301534 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 301539 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 301544 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits 301546 Ribosomal protein S2 301547 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 301549 WD domain, G-beta repeat 301549 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 301553 Ribosomal protein S5, C-terminal domain 301553 Ribosomal protein S5, N-terminal domain 301555 Cullin family 301568 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 301570 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 301571 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 301572 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 301573 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 301574 Mo25 protein family 301575 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 301576 7 transmembrane receptor (rhodopsin family) 301577 Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding 301582 Ribosomal protein L21e 301584 Alkaline phosphatase 301584 Ribosomal protein L31e 301585 Alkaline phosphatase 301586 Alkaline phosphatase 301587 Alkaline phosphatase 301589 Eukaryotic initiation factor 4E 301591 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 301592 Zinc-binding dehydrogenase 301593 UDP-glucoronosyl and UDP-glucosyl transferase 301594 UDP-glucoronosyl and UDP-glucosyl transferase 301595 UDP-glucoronosyl and UDP-glucosyl transferase 301595 UDP-glucoronosyl and UDP-glucosyl transferase 301595 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 301597 Cyclophilin type peptidyl-prolyl cis-trans isomerase 301600 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301600 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301604 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 301604 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 301607 14-3-3 protein 301608 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301609 Ribosomal protein L36e 301610 7 transmembrane receptor (rhodopsin family) 301611 7 transmembrane receptor (rhodopsin family) 301612 7 transmembrane receptor (rhodopsin family) 301613 7 transmembrane receptor (rhodopsin family) 301614 7 transmembrane receptor (rhodopsin family) 301615 7 transmembrane receptor (rhodopsin family) 301619 PH domain. PH stands for pleckstrin homology 301622 Thymidylate kinase 301624 pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold 301626 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 301627 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 301628 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301630 S-adenosyl-L-homocysteine hydrolase 301631 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301632 S25 ribosomal protein 301633 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 301635 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 301637 Proteasome A-type and B-type 301638 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 301639 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 301640 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 301642 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301643 Translationally controlled tumor protein 301645 Core histone H2A/H2B/H3/H4 301646 ENV polyprotein (coat polyprotein) 301650 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 301653 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 301657 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 301657 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 301658 Protein kinase domain 301663 Ribosomal S17 301664 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 301670 SH2 domain 301671 SH2 domain 301672 Ribosomal protein L31e 301673 DNA mismatch repair proteins, mutS family 301674 F-box domain 301674 Putative zinc finger in N-recognin 301677 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301677 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301678 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr 301679 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 301679 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 301681 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301683 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 301685 Enolase, C-terminal TIM barrel domain 301685 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 301687 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301690 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 301690 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301691 Ribosomal protein S7e 301693 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301694 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301704 Ribosomal protein L24e 301706 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 301707 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 301708 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 301709 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 301710 Ribosomal L18ae protein family 301713 MAM domain. An extracellular domain found in many receptors 301715 Fibronectin type III domain 301715 MAM domain. An extracellular domain found in many receptors 301719 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301720 Protein kinase domain 301722 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301723 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301724 Protein kinase domain 301726 Protein kinase domain 301728 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 301730 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 301731 Eukaryotic ribosomal protein L18 301732 Ribosomal protein L19e 301735 Enolase, N-terminal domain 301735 Enolase, C-terminal TIM barrel domain 301737 Protein kinase domain 301750 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 301753 impB/mucB/samB family. These proteins are involved in UV protection 301754 Resolvase, N terminal domain. The N-terminal domain of the resolvase family (this family) contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase - 301755 Transposase, Mutator family 301756 Conserved hypothetical protein 95 301756 ubiE/COQ5 methyltransferase family 301756 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple 301756 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 301759 HMG (high mobility group) box 301761 Ribosomal protein S7e 301764 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 301766 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 301767 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 301768 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301771 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301772 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301773 Ribosomal protein L19e 301774 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301775 Ribosomal protein L19e 301777 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301778 Ribosomal protein L19e 301779 Ribosomal protein L19e 301780 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301785 WD domain, G-beta repeat 301786 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301788 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301792 ADP-ribosylation factor family 301792 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 301793 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 301794 7 transmembrane receptor (rhodopsin family) 301795 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 301796 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 301797 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301797 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 301799 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 301799 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 301801 Ribosomal protein L19e 301802 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301803 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301807 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301808 Ribosomal protein L19e 301810 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301813 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301814 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301816 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301818 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301819 7 transmembrane receptor (rhodopsin family) 301822 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301824 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301827 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301830 Ribosomal protein L19e 301831 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301832 Ribosomal protein L19e 301833 Ribosomal protein L19e 301835 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301839 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301840 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301841 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301842 Ribosomal protein L19e 301843 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301844 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301848 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301852 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301853 Ribosomal protein L19e 301854 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301856 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301857 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301860 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301862 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301866 Ribosomal protein L19e 301867 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301868 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301869 Ribosomal protein L19e 301870 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301872 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301873 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301876 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301877 Glutaredoxin 301878 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301879 Ribosomal protein L19e 301880 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301881 Ribosomal protein L19e 301882 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301883 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301885 Uncharacterized ACR, COG1579 301885 Intermediate filament protein 301885 7 transmembrane receptor (rhodopsin family) 301885 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 301885 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 301885 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 301886 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301887 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 301890 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301891 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301893 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 301894 7 transmembrane receptor (metabotropic glutamate family) 301894 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 301897 Protein kinase domain 301898 Protein kinase domain 301899 Protein kinase domain 301900 Protein kinase domain 301901 Protein kinase domain 301902 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 301903 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 301910 WD domain, G-beta repeat 301911 ADP-ribosylation factor family 301911 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 301912 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301913 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301914 Ribosomal protein L19e 301915 Ribosomal protein L19e 301918 Ribosomal protein L19e 301923 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301925 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301926 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301928 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301929 Ribosomal protein L19e 301930 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301931 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301932 Ribosomal protein L19e 301932 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301934 Ribosomal protein L19e 301934 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301935 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301943 Ribosomal protein L19e 301944 Ribosomal protein L19e 301944 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301945 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301947 Ribosomal protein L19e 301947 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301948 Intermediate filament protein 301949 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301950 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301951 Ribosomal protein L19e 301953 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 301961 ENV polyprotein (coat polyprotein) 301962 Hsp90 protein 301963 Hsp90 protein 301966 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301967 Ribosomal protein L19e 301968 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301969 Ribosomal protein L19e 301970 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 301971 Kinesin motor domain 301981 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 301989 Glutaredoxin 301990 Glutaredoxin 301991 Glutaredoxin 301992 Glutaredoxin 301995 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 302000 Glutaredoxin 302002 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302003 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302005 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302006 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302007 7 transmembrane receptor (rhodopsin family) 302009 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302011 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302012 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302013 Ribosomal protein L19e 302017 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302018 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302021 Mitosis protein DIM1 302021 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302023 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302024 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 302025 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302026 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 302027 7 transmembrane receptor (rhodopsin family) 302028 7 transmembrane receptor (metabotropic glutamate family) 302028 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 302033 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302033 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302034 S-adenosyl-L-homocysteine hydrolase 302035 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 302040 7 transmembrane receptor (rhodopsin family) 302041 7 transmembrane receptor (rhodopsin family) 302043 7 transmembrane receptor (rhodopsin family) 302044 7 transmembrane receptor (rhodopsin family) 302045 7 transmembrane receptor (rhodopsin family) 302046 7 transmembrane receptor (rhodopsin family) 302047 7 transmembrane receptor (rhodopsin family) 302048 7 transmembrane receptor (rhodopsin family) 302048 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 302052 7 transmembrane receptor (rhodopsin family) 302053 7 transmembrane receptor (rhodopsin family) 302054 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 302055 7 transmembrane receptor (rhodopsin family) 302056 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 302060 Protein kinase domain 302061 Protein kinase domain 302061 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 302062 Protein kinase domain 302063 Protein kinase domain 302063 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 302064 Protein kinase domain 302064 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 302066 Protein kinase domain 302066 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 302070 Protein kinase domain 302071 Protein kinase domain 302072 Protein kinase domain 302073 Protein kinase domain 302074 Protein kinase domain 302075 Protein kinase domain 302076 Protein kinase domain 302086 7 transmembrane receptor (rhodopsin family) 302087 7 transmembrane receptor (rhodopsin family) 302087 Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production 302089 Myosin head (motor domain) 302097 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302103 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302104 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 302104 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302104 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 302105 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302106 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302107 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302109 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302110 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 302110 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302110 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 302111 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302113 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302114 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302115 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302116 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302117 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302118 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302119 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302120 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302121 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302123 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302123 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302124 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 302124 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302125 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302126 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302127 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302128 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302129 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302131 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302132 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302133 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 302135 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302136 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302137 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302138 Ribosomal protein L19e 302139 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302140 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302141 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302142 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302143 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302146 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302147 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302149 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302152 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302153 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302154 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302156 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302157 Ribosomal protein L19e 302158 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302159 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302160 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302162 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302163 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302166 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302167 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302167 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302168 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 302170 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 302179 ADP-ribosylation factor family 302179 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302183 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 302183 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 302205 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 302213 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302217 ENV polyprotein (coat polyprotein) 302218 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302220 7 transmembrane receptor (rhodopsin family) 302221 ENV polyprotein (coat polyprotein) 302224 ADP-ribosylation factor family 302224 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302226 Ribosomal protein L19e 302227 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302228 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302229 Ribosomal protein L19e 302230 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302231 Ribosomal protein L19e 302232 Glutaredoxin 302233 Hsp90 protein 302235 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302236 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302237 Ribosomal protein L19e 302240 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302240 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302241 Hsp90 protein 302250 Ribosomal protein L19e 302251 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302254 7 transmembrane receptor (rhodopsin family) 302255 7 transmembrane receptor (rhodopsin family) 302259 Protein kinase domain 302260 Protein kinase domain 302261 Protein kinase domain 302262 Protein kinase domain 302263 Protein kinase domain 302267 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302270 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302272 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 302273 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 302274 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 302275 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 302276 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 302279 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302282 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 302287 Ribosomal protein S5, C-terminal domain 302288 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 302289 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 302290 Clathrin adaptor complex small chain 302290 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 302291 ADP-ribosylation factor family 302291 Clathrin adaptor complex small chain 302291 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 302292 Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair 302293 ADP-ribosylation factor family 302293 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 302294 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302294 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302296 Galactoside-binding lectin 302299 Actin 302300 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 302303 Actin 302303 Regulator of chromosome condensation (RCC1) 302304 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 302306 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 302310 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 302313 Tetraspanin family 302314 Hsp90 protein 302315 ENV polyprotein (coat polyprotein) 302316 Fructose-bisphosphate aldolase class-I 302317 HMG (high mobility group) box 302318 Phosphorylase family 2 302319 Ribosomal protein S2 302321 HMG (high mobility group) box 302324 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 302325 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302326 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 302329 Nucleoside diphosphate kinase 302330 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 302333 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 302339 Fibrillarin 302340 Actin 302345 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302345 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302349 ENV polyprotein (coat polyprotein) 302351 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302351 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302352 Homeobox domain 302356 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 302357 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 302360 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 302360 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 302366 Zinc finger, C3HC4 type (RING finger) 302369 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 302370 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 302371 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 302372 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 302373 7 transmembrane receptor (rhodopsin family) 302374 Enolase, C-terminal TIM barrel domain 302375 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 302376 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 302377 7 transmembrane receptor (rhodopsin family) 302378 7 transmembrane receptor (rhodopsin family) 302380 Ribosomal protein L21e 302381 Galactoside-binding lectin 302382 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302384 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 302387 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302387 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302388 Ribosomal protein L19e 302389 HMG (high mobility group) box 302391 Ribosomal protein L19e 302392 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302393 Ribosomal protein S8e 302394 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302395 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 302395 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 302398 Mitochondrial carrier protein 302400 Homeobox domain 302402 Ribosomal L18ae protein family 302404 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 302405 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302406 Ribosomal protein S7e 302408 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302409 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302410 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 302411 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 302412 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 302415 Fork head domain 302416 Ribosomal protein S5, C-terminal domain 302422 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 302423 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 302425 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 302426 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 302426 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 302428 Ribosomal L29e protein family 302429 Proliferating cell nuclear antigen, N-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 302431 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302432 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302433 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302434 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302435 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302437 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 302438 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302439 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 302439 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 302440 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302441 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302442 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302445 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 302453 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302454 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 302456 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 302460 Intermediate filament protein 302460 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 302462 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 302465 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302466 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302467 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302468 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302469 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302472 NOL1/NOP2/sun family 302473 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 302477 'chromo' (CHRromatin Organization MOdifier) domain 302478 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302479 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 302485 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 302487 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302488 Ubiquitin carboxyl-terminal hydrolase family 2 302488 Ubiquitin carboxyl-terminal hydrolases family 2 302494 ATP synthase 302495 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302495 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 302496 Actin 302497 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 302500 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 302501 Cullin family 302501 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 302502 Cullin family 302506 Quinolinate phosphoribosyl transferase, N-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses 302508 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 302509 Proteasome A-type and B-type 302510 Cyclophilin type peptidyl-prolyl cis-trans isomerase 302512 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 302514 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 302515 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302516 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 302517 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 302518 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302518 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302521 ENV polyprotein (coat polyprotein) 302522 Cyclophilin type peptidyl-prolyl cis-trans isomerase 302523 Ribosomal protein S27 302525 Ribosomal protein L19e 302528 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 302530 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 302532 Ribosomal protein S6e 302533 Cyclophilin type peptidyl-prolyl cis-trans isomerase 302535 Elongation factor 1 gamma, conserved domain 302535 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 302536 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302538 C2 domain 302538 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 302538 C2 domain 302538 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 302539 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 302541 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 302543 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302546 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302547 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 302547 HpcH/HpaI aldolase family. This family includes 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase and 4-hydroxy-2-oxovalerate aldolase 302547 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 302548 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302549 ADP-ribosylation factor family 302549 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302550 Protein kinase domain 302551 Ribosomal protein L31e 302552 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 302553 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 302554 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 302555 Zn-finger in ubiquitin-hydrolases and other protein 302556 ADP-ribosylation factor family 302556 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302556 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 302557 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 302558 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 302561 Cyclophilin type peptidyl-prolyl cis-trans isomerase 302564 Ribosomal protein L3 302566 Ubiquitin carboxyl-terminal hydrolase family 2 302566 Ubiquitin carboxyl-terminal hydrolases family 2 302567 LIM domain. This family represents two copies of the LIM structural domain 302568 NOL1/NOP2/sun family 302568 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 302571 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302572 Core histone H2A/H2B/H3/H4 302574 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 302575 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 302576 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 302578 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 302579 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 302579 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 302579 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 302583 Fork head domain 302585 Apoptosis regulator proteins, Bcl-2 family 302586 Regulator of chromosome condensation (RCC1) 302588 Thymosin beta-4 family 302589 ENV polyprotein (coat polyprotein) 302590 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302591 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 302591 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle 302592 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 302595 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 302595 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 302601 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302602 Ribosomal protein L19e 302605 PH domain. PH stands for pleckstrin homology 302606 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 302607 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 302610 Enolase, C-terminal TIM barrel domain 302610 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 302611 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 302613 Ribosomal protein L31e 302614 Phosphatidylinositol 3- and 4-kinase 302615 Ribosomal protein L35Ae 302617 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 302619 Ocular albinism type 1 protein 302619 7 transmembrane receptor (Secretin family) 302620 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302625 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 302626 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302630 F-actin capping protein alpha subunit 302635 7 transmembrane receptor (rhodopsin family) 302636 Glutaredoxin 302637 Glutaredoxin 302638 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302639 Zinc finger, C3HC4 type (RING finger) 302640 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-Co 302642 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 302643 Octicosapeptide repeat. Short motif that may bind Ca2+ 302645 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302646 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302647 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 302650 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302655 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 302658 'Cold-shock' DNA-binding domain 302659 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 302659 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 302660 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 302666 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 302667 PH domain. PH stands for pleckstrin homology 302667 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 302668 Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid 302669 Eukaryotic-type carbonic anhydrase 302670 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302671 Clathrin adaptor complex small chain 302671 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 302674 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 302675 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302677 Glutamine amidotransferase class-I 302677 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 302677 Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold 302678 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 302679 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 302681 Proteasome A-type and B-type 302682 Mitochondrial carrier protein 302685 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302686 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 302686 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 302688 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 302690 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302690 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302694 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 302694 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 302695 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 302697 Eukaryotic initiation factor 1A 302698 Ribosomal L29e protein family 302699 Hsp90 protein 302702 Nucleoside diphosphate kinase 302703 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 302703 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 302705 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 302706 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302707 Ribosomal protein S24e 302708 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 302710 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302711 WD domain, G-beta repeat 302713 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 302714 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 302715 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 302716 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 302719 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302720 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 302721 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 302721 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 302722 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302723 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302725 SKIP/SNW domain. This domain is found in chromatin proteins 302728 Ribosomal L10 302729 Death domain 302740 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 302742 HMG (high mobility group) box 302743 Actin 302744 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302745 Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate 302746 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302747 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302748 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302749 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302750 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 302753 GDP dissociation inhibitor 302755 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 302757 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302760 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302761 Ribosomal protein L44 302762 WD domain, G-beta repeat 302763 Actin 302766 HMG (high mobility group) box 302768 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302769 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 302771 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 302773 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 302774 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 302775 Zinc finger, C3HC4 type (RING finger) 302779 ENV polyprotein (coat polyprotein) 302780 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302781 ENV polyprotein (coat polyprotein) 302782 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302782 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302784 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 302785 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 302786 Hsp90 protein 302787 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 302788 HMG (high mobility group) box 302789 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 302789 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 302790 Hsp90 protein 302791 Hsp90 protein 302792 HMG (high mobility group) box 302793 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 302793 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302794 ENV polyprotein (coat polyprotein) 302795 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302795 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302796 Actin 302797 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 302798 HMG (high mobility group) box 302799 HMG (high mobility group) box 302800 Ribosomal S17 302801 HMG (high mobility group) box 302803 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 302804 Phosphomannose isomerase type I 302804 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 302806 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302806 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302807 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 302808 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 302809 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 302809 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 302811 Ets-domain 302813 7 transmembrane receptor (rhodopsin family) 302813 7 transmembrane receptor (rhodopsin family) 302816 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302816 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 302817 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302819 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 302820 7 transmembrane receptor (rhodopsin family) 302821 7 transmembrane receptor (rhodopsin family) 302822 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv 302822 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 302823 7 transmembrane receptor (rhodopsin family) 302824 7 transmembrane receptor (rhodopsin family) 302825 7 transmembrane receptor (rhodopsin family) 302826 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 302827 Actin 302829 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302831 Eukaryotic porin 302835 Tropomyosin 302836 Ribosomal protein S2 302838 Eukaryotic porin 302840 HMG (high mobility group) box 302842 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 302842 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 302845 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 302846 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 302847 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 302848 NAC domain 302851 Microtubule associated protein (MAP65/ASE1 family) 302852 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 302855 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302856 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 302856 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 302858 Core histone H2A/H2B/H3/H4 302859 Hsp90 protein 302862 NAC domain 302862 Ribosomal protein L19e 302865 Clathrin adaptor complex small chain 302867 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 302869 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 302870 Ribosomal protein S8e 302871 Oxidoreductase FAD-binding domain 302871 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 302872 Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372 302873 Zinc finger, C3HC4 type (RING finger) 302876 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 302879 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302880 WD domain, G-beta repeat 302881 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 302886 Endomembrane protein 70 302891 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 302893 PX domain. PX domains bind to phosphoinositides 302898 Putative replicase 1 (ORF2) 302899 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 302903 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 302904 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 302914 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 302915 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 302918 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 302920 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302931 PWI domain 302931 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 302937 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 302937 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 302939 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 302940 Ribosomal protein L10 302943 Ribosomal protein L6 302944 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 302945 Fibronectin type III domain 302950 Adenylate and Guanylate cyclase catalytic domain 302952 Glutathione peroxidase 302953 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 302954 7 transmembrane receptor (rhodopsin family) 302957 7 transmembrane receptor (rhodopsin family) 302958 7 transmembrane receptor (rhodopsin family) 302961 Ribosomal protein S5, C-terminal domain 302961 Ribosomal protein S5, N-terminal domain 302962 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 302963 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 302963 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 302963 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 302967 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 302970 Trypsin 302971 Trypsin 302971 Ribosomal L39 protein 302972 N-acetylglucosamine-6-phosphate deacetylase. This family are predicted to adopt a TIM barrel fold. This is family of N-acetylglucosamine-6-phosphate deacetylases, EC:3.5.1.25 These enzymes catalyse the reaction N-acetyl-D-glucosamine 6-phosphate + H2O <=> 302975 Synaptogyrin. This family of proteins is distantly related to pfam01284 302976 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 302978 Sulfotransferase protein 302980 Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the tyr 302981 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 302983 Uncharacterized ACR, COG1579 302983 Intermediate filament protein 302983 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 302983 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 302988 C2 domain 302989 Trypsin 302990 Trypsin 302991 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 302993 HMG (high mobility group) box 302995 Metallo-beta-lactamase superfamily 303001 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 303002 Ribosomal protein L3 303003 Nucleoside diphosphate kinase 303004 Per1-like. A member of this family has been implemented in protein processing in the endoplasmic reticulum 303005 Protein of unknown function, DUF259 303007 Globin 303008 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 303021 PX domain. PX domains bind to phosphoinositides 303021 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 303023 DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon su 303024 F-box domain 303024 WD domain, G-beta repeat 303025 Ribosomal protein L6 303029 Importin-beta N-terminal domain 303031 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 303032 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 303033 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 303039 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 303039 Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re 303040 Actin 303049 HMG (high mobility group) box 303051 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303051 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303053 Ribosomal protein S19e 303054 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 303058 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 303059 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 303059 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 303060 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 303068 HMG (high mobility group) box 303083 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 303086 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 303088 B-box zinc finger 303088 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303089 B-box zinc finger 303089 Zinc finger, C3HC4 type (RING finger) 303089 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303090 Protein of unknown function, DUF258 303091 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 303093 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 303096 7 transmembrane receptor (rhodopsin family) 303105 7 transmembrane receptor (rhodopsin family) 303107 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303113 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 303115 Ribosomal L10 303129 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 303130 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 303133 pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex 303134 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 303135 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 303136 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303140 Sugar (and other) transporter 303141 Cyclic nucleotide-binding domain 303141 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 303141 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 303141 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 303145 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 303146 HIT family 303148 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 303159 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M 303160 WD domain, G-beta repeat 303162 Ribosomal protein S8 303166 Phospholipase A2 inhibitor 303167 B-box zinc finger 303167 Zinc finger, C3HC4 type (RING finger) 303167 7 transmembrane receptor (rhodopsin family) 303167 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303168 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 303168 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 303170 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 303170 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303170 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303171 7 transmembrane receptor (rhodopsin family) 303171 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303172 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303173 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303175 Core histone H2A/H2B/H3/H4 303175 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 303176 Core histone H2A/H2B/H3/H4 303176 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 303177 Core histone H2A/H2B/H3/H4 303178 B-box zinc finger 303178 Zinc finger, C3HC4 type (RING finger) 303178 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303179 Connexin 303181 wnt family 303182 wnt family 303189 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303189 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303191 Myosin head (motor domain) 303192 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 303194 Topoisomerase DNA binding C4 zinc finger 303195 NAC domain 303201 Ubiquitin carboxyl-terminal hydrolase family 2 303201 Ubiquitin carboxyl-terminal hydrolases family 2 303201 Zn-finger in ubiquitin-hydrolases and other protein 303202 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303203 Uncharacterized membrane protein family UPF0013. These proteins are integral membrane proteins of unknown function 303205 Sodium:solute symporter family 303206 Protein kinase domain 303209 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 303212 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 303212 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303213 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303213 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 303214 B-box zinc finger 303214 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303215 WD domain, G-beta repeat 303215 High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin 303218 Sulfotransferase protein 303220 Sulfotransferase protein 303222 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 303226 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 303226 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303226 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 303228 Ribosomal protein S5, C-terminal domain 303228 Ribosomal protein S5, N-terminal domain 303229 Enolase, N-terminal domain 303229 Enolase, C-terminal TIM barrel domain 303233 Ubiquitin carboxyl-terminal hydrolase family 2 303233 Ubiquitin carboxyl-terminal hydrolases family 2 303235 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303236 PMP-22/EMP/MP20/Claudin family 303237 Mitochondrial carrier protein 303241 'chromo' (CHRromatin Organization MOdifier) domain 303241 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 303241 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 303241 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 303243 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 303244 PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. Howe 303245 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 303247 Protein kinase domain 303252 Lipoxygenase 303252 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 303257 Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furth 303260 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 303270 Ribosomal protein L15 303272 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 303272 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 303274 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 303275 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 303276 GTPase of unknown function 303284 Ribosomal protein L23 303284 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 303285 TRAF-type zinc finger 303285 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 303287 F-box domain 303289 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 303290 Gamma-glutamyltranspeptidase 303291 Sugar (and other) transporter 303291 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 303293 Ribosomal protein S27 303293 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 303294 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 303296 7 transmembrane receptor (rhodopsin family) 303297 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 303298 Rap/ran-GAP 303304 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 303304 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 303306 Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni 303307 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 303308 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 303310 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 303312 Beige/BEACH domain 303312 WD domain, G-beta repeat 303314 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 303321 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 303322 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 303323 Ribosomal protein S2 303328 Protein kinase domain 303329 Protein kinase domain 303336 WD domain, G-beta repeat 303337 Intermediate filament protein 303337 Microtubule associated protein (MAP65/ASE1 family) 303337 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 303337 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 303337 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303340 Stathmin family 303341 Intermediate filament protein 303342 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 303346 Caldesmon 303347 Helix-loop-helix DNA-binding domain 303349 Nucleoside diphosphate kinase 303349 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 303350 Zinc finger, C3HC4 type (RING finger) 303350 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303353 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 303358 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 303361 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303361 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303371 Fibronectin type III domain 303372 WD domain, G-beta repeat 303374 Transaldolase 303376 Fibronectin type III domain 303377 NB-ARC domain 303377 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 303379 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303379 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303382 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303384 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 303384 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 303385 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 303388 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 303389 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 303391 Myosin head (motor domain) 303391 IQ calmodulin-binding motif. Calmodulin-binding motif 303394 Ubiquitin carboxyl-terminal hydrolase family 2 303394 Ubiquitin carboxyl-terminal hydrolases family 2 303398 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 303399 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 303405 Yippee putative zinc-binding protein 303407 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 303408 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 303413 Animal haem peroxidase 303414 Animal haem peroxidase 303415 7 transmembrane receptor (rhodopsin family) 303419 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 303439 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 303448 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303448 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303449 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 303454 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 303455 Eukaryotic-type carbonic anhydrase 303462 AMP-binding enzyme 303463 AMP-binding enzyme 303466 Cyclophilin type peptidyl-prolyl cis-trans isomerase 303467 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 303469 Homeobox domain 303470 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 303470 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 303470 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 303471 Stathmin family 303476 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 303477 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 303477 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 303478 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 303479 Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses 303479 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 303479 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 303480 Homeobox domain 303484 Homeobox domain 303485 Homeobox domain 303486 Homeobox domain 303488 Homeobox domain 303489 Homeobox domain 303490 Homeobox domain 303491 Homeobox domain 303493 PX domain. PX domains bind to phosphoinositides 303494 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 303495 Clathrin adaptor complex small chain 303496 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 303497 Oxysterol-binding protein 303497 PH domain. PH stands for pleckstrin homology 303499 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 303500 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 303501 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 303505 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 303506 ADP-ribosylation factor family 303506 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 303506 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303516 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 303518 Intermediate filament protein 303519 Intermediate filament protein 303520 Intermediate filament protein 303521 Intermediate filament protein 303522 Intermediate filament protein 303522 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303523 Intermediate filament protein 303523 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303526 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 303527 Intermediate filament protein 303527 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303528 Intermediate filament protein 303529 Intermediate filament protein 303529 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303530 Intermediate filament protein 303530 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 303531 Intermediate filament protein 303531 Flagellar hook-associated protein 2. The flagellar hook-associated protein 2 (HAP2 or FliD) forms the distal end of the flagella, and plays a role in mucin specific adhesion of the bacteria 303531 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303533 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 303533 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 303535 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 303538 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 303538 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 303540 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 303540 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 303541 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 303541 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 303542 GH3 auxin-responsive promoter 303549 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 303550 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 303550 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 303550 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 303552 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 303553 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303554 Octicosapeptide repeat. Short motif that may bind Ca2+ 303554 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 303557 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 303559 ADP-ribosylation factor family 303559 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 303559 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303561 L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7 303561 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 303561 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 303562 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 303565 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 303567 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 303569 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 303571 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 303575 Kinesin motor domain 303576 Ribosomal L22e protein family 303577 Acyl CoA binding protein 303583 PH domain. PH stands for pleckstrin homology 303583 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 303583 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 303584 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 303584 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 303586 wnt family 303600 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 303601 Cytochrome b561 303604 Protein kinase domain 303605 Protein kinase domain 303605 LIM domain. This family represents two copies of the LIM structural domain 303607 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 303607 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 303612 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 303614 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 303615 C2 domain 303615 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 303618 Phosphatidylinositol transfer protein 303623 Protein kinase domain 303623 Protein kinase C terminal domain 303624 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 303629 Macrophage migration inhibitory factor (MIF) 303631 Sulfatase 303632 Sulfatase 303632 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 303634 ADP-ribosylation factor family 303634 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 303637 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 303639 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 303651 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 303658 Ribosomal L38e protein family 303659 Kinesin motor domain 303660 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 303661 7 transmembrane receptor (rhodopsin family) 303663 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 303665 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 303665 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 303666 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 303667 Mitochondrial carrier protein 303668 ADP-ribosylation factor family 303668 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303669 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 303670 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 303671 Fatty acid desaturase 303673 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 303674 Sugar (and other) transporter 303676 Mitochondrial carrier protein 303676 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 303678 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 303678 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 303678 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 303682 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 303683 B-box zinc finger 303683 Zinc finger, C3HC4 type (RING finger) 303683 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303684 Zinc finger, C3HC4 type (RING finger) 303684 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 303685 Phosphatidylethanolamine-binding protein 303687 Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen 303689 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 303690 Domain of unknown function DUF20. This transmembrane region is found in putative permeases and predicted transmembrane proteins it has no known function. It is not clear what source suggested that these proteins may be permeases and this information shoul 303690 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 303692 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 303694 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 303694 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 303700 Ubiquitin carboxyl-terminal hydrolase family 2 303701 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 303701 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 303702 Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates 303703 Myosin head (motor domain) 303708 Myosin head (motor domain) 303711 Myosin head (motor domain) 303713 Myosin head (motor domain) 303717 Myosin head (motor domain) 303718 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 303718 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 303723 Myosin head (motor domain) 303725 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 303728 Myosin head (motor domain) 303729 Sulfatase 303729 Metalloenzyme superfamily. This family includes phosphopentomutase and 2,3-bisphosphoglycerate-independent phosphoglycerate mutase. This family is also related to pfam00245. The alignment contains the most conserved residues that are probably involved in 303729 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 303730 'chromo' (CHRromatin Organization MOdifier) domain 303731 'chromo' (CHRromatin Organization MOdifier) domain 303732 'chromo' (CHRromatin Organization MOdifier) domain 303734 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 303736 Zinc finger, C3HC4 type (RING finger) 303738 RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits 303743 Pectinacetylesterase 303745 Sir2 family 303750 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold 303751 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 303752 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 303753 Fork head domain 303753 pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain 303754 ADP-ribosylation factor family 303754 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 303755 Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterised but probably have a similar or identical function 303755 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 303760 SH2 domain 303761 7 transmembrane receptor (metabotropic glutamate family) 303761 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 303762 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303770 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 303772 Sugar (and other) transporter 303773 jmjC domain 303775 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 303777 AMP-binding enzyme 303778 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303778 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303781 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 303784 Yippee putative zinc-binding protein 303785 Fibronectin type III domain 303785 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 303786 Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines 303786 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 303787 Amino acid permease 303788 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyne 303789 Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyne 303792 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 303793 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 303795 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 303796 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 303798 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 303801 Lysosome-associated membrane glycoprotein (Lamp) 303803 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 303809 Ribosomal protein S7e 303810 Ribosomal L15 303811 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 303811 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 303811 Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778) 303811 DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D 303814 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 303815 Ribosomal protein L11, RNA binding domain 303815 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 303816 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 303816 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 303818 NUDIX domain 303823 Protein kinase domain 303824 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 303826 Ribosomal protein L15 303827 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 303827 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 303828 Ets-domain 303829 HMG14 and HMG17 303829 ENV polyprotein (coat polyprotein) 303831 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 303831 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 303832 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 303834 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 303836 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 303844 Ribosomal L10 303848 Mab-21 protein 303859 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 303860 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 303861 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 303861 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 303862 Ribosomal protein S26e 303872 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 303874 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 303876 WD domain, G-beta repeat 303877 Zinc finger, C3HC4 type (RING finger) 303878 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 303879 Domain of unknown function 303882 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 303887 von Willebrand factor type D domain 303887 AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges 303888 Oxysterol-binding protein 303888 PH domain. PH stands for pleckstrin homology 303889 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 303889 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 303891 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 303892 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 303904 Zinc finger, C3HC4 type (RING finger) 303905 Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi 303907 WD domain, G-beta repeat 303908 RanBP1 domain 303909 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 303911 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 303912 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 303916 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 303918 TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins t 303919 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 303923 Ribosomal protein L21e 303923 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 303924 Tetraspanin family 303930 Ribosomal protein L6 303935 Ribosomal L39 protein 303952 TRAF-type zinc finger 303952 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 303953 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 303955 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 303956 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 303959 Core histone H2A/H2B/H3/H4 303961 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 303965 Myosin head (motor domain) 303965 Intermediate filament protein 303965 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 303965 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 303965 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 303965 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 303972 HMG (high mobility group) box 303977 CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and 303979 Hydratase/decarboxylase. This family consist of various hydratases and 4-oxalocrotonate decarboxylases which are involved in the bacterial meta-cleavage pathways for degradation of aromatic compounds. 2-hydroxypentadienoic acid hydratase encoded by mhpD i 303990 Homeobox domain 303991 Homeobox domain 303992 Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation 303993 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 303993 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304005 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 304008 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304010 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 304012 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 304013 Fibronectin type III domain 304014 7 transmembrane receptor (Secretin family) 304014 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 304015 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 304015 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 304017 TPR Domain 304021 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 304023 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 304024 Coproporphyrinogen III oxidase 304026 7 transmembrane receptor (rhodopsin family) 304028 Protein kinase domain 304029 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 304031 NOL1/NOP2/sun family 304033 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 304034 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 304035 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304037 ADP-ribosylation factor family 304038 Syntaxin 304039 ADP-ribosylation factor family 304039 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 304039 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 304040 Aminotransferase class-III 304041 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304042 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304044 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304045 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304046 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304047 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304048 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304052 Ornithine decarboxylase antizyme 304053 Protein kinase domain 304053 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 304056 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 304061 WD domain, G-beta repeat 304067 ribosomal L5P family C-terminus. This region is found associated with pfam00281 304068 Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps 304070 Biotin protein ligase C terminal domain. The function of this structural domain is unknown. It is found to the C terminus of the biotin protein ligase catalytic domain pfam01317 304070 Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for 304071 Helix-loop-helix DNA-binding domain 304071 PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold 304071 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 304073 PMP-22/EMP/MP20/Claudin family 304075 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 304078 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 304079 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 304084 C2 domain 304085 Ribosomal protein S6 304085 Pentapeptide repeats (8 copies). These repeats are found in many cyanobacterial proteins. The repeats were first identified in hglK. The function of these repeats is unknown. The structure of this repeat has been predicted to be a beta-helix. The repeat c 304085 Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int 304087 Sodium:solute symporter family 304088 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 304089 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 304092 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 304092 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 304093 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 304102 Galactose binding lectin domain 304103 Helix-loop-helix DNA-binding domain 304109 Raf-like Ras-binding domain 304109 PH domain. PH stands for pleckstrin homology 304109 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 304112 7 transmembrane receptor (rhodopsin family) 304116 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304124 PMP-22/EMP/MP20/Claudin family 304125 PMP-22/EMP/MP20/Claudin family 304127 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 304127 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 304135 Thrombospondin type 1 domain 304135 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 304135 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 304143 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 304150 Ubiquitin carboxyl-terminal hydrolase family 2 304168 Eukaryotic initiation factor 4E 304174 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 304181 Protein kinase domain 304182 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304183 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 304185 ATP synthase alpha/beta chain, C terminal domain 304185 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 304191 Helix-loop-helix DNA-binding domain 304192 Ribosomal protein S7e 304193 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 304196 RanBP1 domain 304197 Lectin C-type domain. This family includes both long and short form C-type 304199 Lectin C-type domain. This family includes both long and short form C-type 304200 Lectin C-type domain. This family includes both long and short form C-type 304201 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 304208 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 304209 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 304210 7 transmembrane receptor (metabotropic glutamate family) 304211 'chromo' (CHRromatin Organization MOdifier) domain 304212 'chromo' (CHRromatin Organization MOdifier) domain 304215 START domain 304215 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 304216 ENV polyprotein (coat polyprotein) 304224 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 304228 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 304229 7 transmembrane receptor (metabotropic glutamate family) 304229 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 304231 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304232 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304238 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 304239 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 304240 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 304242 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304243 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 304244 'chromo' (CHRromatin Organization MOdifier) domain 304255 Ribosomal protein S19e 304264 Ribosomal L18ae protein family 304265 Protein kinase domain 304265 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 304266 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 304266 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 304266 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 304267 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 304267 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 304268 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 304270 Protein kinase domain 304271 Zinc finger, C3HC4 type (RING finger) 304273 ENV polyprotein (coat polyprotein) 304273 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 304273 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 304273 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 304274 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 304275 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 304278 PPR repeat. This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilisation. Thi 304279 Ribosomal protein S8 304280 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 304285 PH domain. PH stands for pleckstrin homology 304287 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 304287 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 304289 Lipase (class 3) 304290 ER lumen protein retaining receptor 304291 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 304293 Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165 304293 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304293 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304297 Fibronectin type III domain 304299 DIL domain. The DIL domain has no known function 304299 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 304300 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 304301 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 304301 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 304305 Luciferase-like monooxygenase 304305 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 304307 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 304307 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 304310 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 304310 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 304314 Uncharacterized ACR, COG1579 304314 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 304314 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 304319 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304322 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 304323 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple 304325 Domain of unknown function 304328 7 transmembrane receptor (rhodopsin family) 304333 Cyclic nucleotide-binding domain 304336 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304337 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304338 Ribosomal protein S8e 304340 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304340 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304341 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 304341 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304341 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304342 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 304342 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304343 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 304344 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 304346 Metallo-beta-lactamase superfamily 304347 7 transmembrane receptor (metabotropic glutamate family) 304347 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 304348 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 304358 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 304361 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 304368 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 304369 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 304370 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 304371 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 304372 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 304373 F-box domain 304374 F-box domain 304375 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 304378 GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein 304379 MAM domain. An extracellular domain found in many receptors 304380 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 304385 Clathrin adaptor complex small chain 304387 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 304388 PMP-22/EMP/MP20/Claudin family 304396 WD domain, G-beta repeat 304399 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 304401 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 304402 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 304403 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 304406 ubiE/COQ5 methyltransferase family 304407 PMP-22/EMP/MP20/Claudin family 304423 Flavodoxin 304429 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 304436 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 304438 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 304445 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 304448 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 304461 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 304466 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 304467 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 304468 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 304468 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 304472 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 304477 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 304479 Helix-loop-helix DNA-binding domain 304486 Prismane/CO dehydrogenase family. This family includes both hybrid-cluster proteins and the beta chain of carbon monoxide dehydrogenase. The hybrid-cluster proteins contain two Fe/S centers - a [4Fe-4S] cubane cluster, and a hybrid [4Fe-2S-2O] cluster. Th 304494 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 304495 F-box domain 304495 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 304497 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304497 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 304500 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 304502 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 304503 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 304506 Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates 304510 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 304512 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304514 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 304521 Zinc finger, C3HC4 type (RING finger) 304524 Protein kinase domain 304525 Protein kinase domain 304527 Acyl CoA binding protein 304528 Rad17 cell cycle checkpoint protein 304528 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 304529 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 304531 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 304532 Protein kinase domain 304539 Sir2 family 304540 Cytochrome c oxidase subunit VIa 304541 Glu-tRNAGln amidotransferase C subunit. This is a family of Glu-tRNAGln amidotransferase C subunits. The Glu-tRNA Gln amidotransferase enzyme itself is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correc 304542 ubiE/COQ5 methyltransferase family 304545 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 304546 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 304547 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 304550 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 304554 CUB domain 304554 Sushi domain (SCR repeat) 304560 TPR Domain 304567 tRNA pseudouridine synthase. Involved in the formation of pseudouridine at the anticodon stem and loop of transfer-RNAs Pseudouridine is an isomer of uridine (5-(beta-D-ribofuranosyl) uracil, and id the most abundant modified nucleoside found in all cellu 304568 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304569 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 304569 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 304570 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 304570 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 304571 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 304573 Ribosomal L10 304573 DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold 304573 DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities 304574 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304575 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 304576 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304576 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 304577 Uracil DNA glycosylase superfamily 304579 Ubiquitin carboxyl-terminal hydrolase family 2 304579 Ubiquitin carboxyl-terminal hydrolases family 2 304579 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f 304580 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 304582 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304588 Synaptobrevin 304590 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 304590 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 304591 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 304593 Ribosomal protein S6e 304599 DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits 304601 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 304601 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 304601 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 304601 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has 304605 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 304608 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 304608 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 304608 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 304610 7 transmembrane receptor (rhodopsin family) 304612 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 304614 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 304615 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304616 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 304617 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304619 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 304620 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304622 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304624 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304626 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304627 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304628 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304630 Ribosomal protein L21e 304632 7 transmembrane receptor (rhodopsin family) 304633 7 transmembrane receptor (rhodopsin family) 304634 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 304638 Ribosomal L22e protein family 304639 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 304642 Zinc finger, C3HC4 type (RING finger) 304645 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 304645 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 304646 Calreticulin family 304649 Sterile alpha motif (SAM)/Pointed domain 304649 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 304657 Ribonucleotide reductase, small chain 304658 Ribonucleotide reductase, small chain 304663 Helix-loop-helix DNA-binding domain 304667 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch 304670 Importin-beta N-terminal domain 304671 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 304672 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 304673 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 304676 S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 304676 S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold 304687 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 304688 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 304689 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 304690 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 304692 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 304693 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 304694 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 304697 Cadherin domain 304697 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 304698 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 304699 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 304701 Cadherin domain 304705 Protein kinase domain 304705 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 304707 ENV polyprotein (coat polyprotein) 304709 Myosin head (motor domain) 304709 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 304709 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 304710 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 304711 Ribosomal protein L13 304712 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 304715 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 304715 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 304716 Protein kinase domain 304717 Glutamine amidotransferase class-I 304718 ERCC4 domain. This domain is predicted to be a nuclease domain 304720 Pyridoxal-dependent decarboxylase conserved domain 304721 Pyridoxal-dependent decarboxylase conserved domain 304725 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 304727 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 304733 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 304734 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 304735 ENV polyprotein (coat polyprotein) 304737 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304739 Hsp90 protein 304742 Homeobox domain 304749 Prolyl oligopeptidase family 304749 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 304754 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 304754 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 304756 Ribosomal protein S5, N-terminal domain 304756 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 304756 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 304760 HNH endonuclease 304761 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 304761 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 304765 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 304766 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 304769 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 304769 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 304770 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 304775 Protein kinase domain 304776 Protein kinase domain 304778 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 304778 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 304779 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304783 ADP-ribosylation factor family 304783 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 304783 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 304784 Sulfate transporter family. Mutations may lead to several human diseases 304784 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 304784 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 304786 Ets-domain 304791 Protein kinase domain 304794 Protein kinase domain 304798 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding dom 304801 pfam02892, zf-BED, BED zinc finger 304805 Oxidoreductase FAD-binding domain 304805 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 304808 Ribosomal protein S7e 304812 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304813 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 304815 Ets-domain 304815 Sterile alpha motif (SAM)/Pointed domain 304816 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 304817 Importin-beta N-terminal domain 304818 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 304822 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 304823 Fibronectin type III domain 304825 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 304829 Ribosomal protein L6 304831 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 304838 Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs 304838 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 304846 Ribosomal protein L23 304847 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 304849 ENV polyprotein (coat polyprotein) 304850 Zinc finger, C3HC4 type (RING finger) 304851 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 304851 N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e 304852 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 304853 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 304855 Ribosomal protein S5, C-terminal domain 304858 Laminin N-terminal (Domain VI) 304860 Dihydrodipicolinate synthetase family. This family has a TIM barrel structure 304868 Ribosomal L29e protein family 304871 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 304871 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 304872 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304873 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 304877 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 304878 Homeobox domain 304878 LIM domain. This family represents two copies of the LIM structural domain 304879 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 304880 LIM domain. This family represents two copies of the LIM structural domain 304886 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 304888 Fibrinogen beta and gamma chains, C-terminal globular domain 304889 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 304891 SH2 domain 304892 Lectin C-type domain. This family includes both long and short form C-type 304893 C2 domain 304893 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 304893 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 304894 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 304897 ATP synthase subunit C 304899 Ribosomal protein S5, C-terminal domain 304905 Zinc finger, C3HC4 type (RING finger) 304913 Fibronectin type III domain 304913 Fibrinogen beta and gamma chains, C-terminal globular domain 304914 Phosphotyrosine interaction domain (PTB/PID) 304915 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 304916 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 304917 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 304918 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 304918 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 304919 pfam02938, GAD, GAD domain. This domain is found in some members of the GatB and aspartyl tRNA synthetases 304919 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 304919 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 304919 OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata 304920 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 304920 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 304922 Hsp90 protein 304922 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 304922 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 304922 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 304935 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 304936 TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway 304938 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304941 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 304941 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 304944 Phosphoribulokinase / Uridine kinase family 304947 Homeobox domain 304947 PBX domain. The PBX domain is a bipartite acidic domain 304950 PBX domain. The PBX domain is a bipartite acidic domain 304951 Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein ha 304954 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid 304956 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 304957 SH2 domain 304958 SH2 domain 304960 Olfactomedin-like domain 304963 HSF-type DNA-binding 304963 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 304964 C-5 cytosine-specific DNA methylase 304971 Eukaryotic porin 304972 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 304973 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 304974 Macrophage migration inhibitory factor (MIF) 304978 WD domain, G-beta repeat 304978 Coatomer WD associated domain 304980 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 304981 Ribosomal protein S2 304983 Calponin family repeat 304983 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 304985 7 transmembrane receptor (rhodopsin family) 304987 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 304987 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 304988 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 304988 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 304989 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 304989 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 304990 7 transmembrane receptor (rhodopsin family) 304991 7 transmembrane receptor (rhodopsin family) 304993 Actin 304998 7 transmembrane receptor (rhodopsin family) 304999 Repeat in ubiquitin-activating (UBA) protein 304999 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 305000 XPG I-region 305000 XPG N-terminal domain 305008 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 305009 NAC domain 305013 Ribosomal protein L31e 305019 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 305023 Fatty acid desaturase 305025 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 305025 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 305025 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 305031 BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1 305034 MIT domain 305034 Protein kinase domain 305034 PX domain. PX domains bind to phosphoinositides 305035 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 305039 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 305045 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 305047 Ribosomal protein L21e 305050 Ribosomal protein L31e 305055 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 305057 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 305058 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 305063 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 305064 Protein-tyrosine phosphatase 305064 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 305071 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 305072 14-3-3 protein 305073 WD domain, G-beta repeat 305074 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 305078 Laminin N-terminal (Domain VI) 305078 Giardia variant-specific surface protein 305078 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 305078 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 305083 Kinesin motor domain 305085 ubiE/COQ5 methyltransferase family 305087 Ribosomal protein L6 305091 Protein kinase domain 305092 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 305096 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 305099 HMG (high mobility group) box 305104 Thrombospondin N-terminal -like domain 305104 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 305106 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 305107 7 transmembrane receptor (metabotropic glutamate family) 305108 Ribosomal protein L13 305109 7 transmembrane receptor (metabotropic glutamate family) 305109 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 305111 7 transmembrane receptor (metabotropic glutamate family) 305111 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 305111 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 305112 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 305120 Domain of Unknown Function (DUF408) 305123 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 305124 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 305124 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 305138 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 305138 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 305139 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 305139 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 305139 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 305142 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 305149 NUDIX domain 305150 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 305161 ENV polyprotein (coat polyprotein) 305162 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 305162 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 305166 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 305167 UbiA prenyltransferase family 305169 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 305171 Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in 305178 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 305180 Ribosomal protein L21e 305181 Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold 305181 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 305181 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 305185 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 305188 Ribosomal S3Ae family 305194 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 305194 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305195 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 305195 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305198 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 305202 Ribosomal protein S19e 305203 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 305211 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 305216 ENV polyprotein (coat polyprotein) 305217 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 305224 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 305227 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 305232 Ribosomal protein L11, RNA binding domain 305232 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 305233 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 305234 Starch binding domain 305235 NAD:arginine ADP-ribosyltransferase 305236 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 305236 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 305237 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 305239 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 305240 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 305240 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 305241 Protein kinase domain 305241 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 305246 Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in 305246 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 305246 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 305248 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 305248 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 305251 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 305253 Thrombospondin type 1 domain 305253 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 305256 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 305258 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 305259 MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) 305262 Cyclophilin type peptidyl-prolyl cis-trans isomerase 305263 Hsp90 protein 305264 UDP-glucoronosyl and UDP-glucosyl transferase 305264 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 305265 Trypsin 305266 Ribosomal protein L23 305267 Trypsin 305268 Repeat in ubiquitin-activating (UBA) protein 305268 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 305269 PH domain. PH stands for pleckstrin homology 305272 Actin 305273 Ribosomal S17 305274 Cytochrome C and Quinol oxidase polypeptide I 305275 Ribosomal protein L6 305276 Ribosomal protein S19e 305277 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero 305279 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 305280 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 305282 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 305284 NAC domain 305290 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 305291 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero 305293 Homeobox domain 305302 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 305309 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 305311 SH2 domain 305311 Protein kinase domain 305311 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 305314 E1-E2 ATPase 305317 Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold 305317 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 305317 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 305317 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 305317 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 305325 E1-E2 ATPase 305331 Homeobox domain 305332 LIM domain. This family represents two copies of the LIM structural domain 305338 Phosphotyrosine interaction domain (PTB/PID) 305338 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 305340 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 305341 ADP-ribosylation factor family 305341 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 305342 Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relate 305344 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 305348 Radical SAM superfamily 305350 Mitochondrial carrier protein 305351 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 305351 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 305353 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 305353 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 305354 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 305354 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 305355 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 305358 Ribosomal protein L24e 305358 Ribosomal L29e protein family 305365 Mitochondrial carrier protein 305367 PH domain. PH stands for pleckstrin homology 305367 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 305367 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 305367 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 305371 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 305373 HMG (high mobility group) box 305375 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 305384 7 transmembrane receptor (rhodopsin family) 305390 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 305392 Chromosome condensation protein 3, C-terminal region. Cnd3 is a Member of the five subunit condensin complex. Each subunit is essential for mitotic condensation 305399 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 305400 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 305400 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 305400 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 305405 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 305408 7 transmembrane receptor (Secretin family) 305408 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 305408 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 305417 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 305417 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, b 305422 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 305423 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 305424 F-box domain 305426 Sugar (and other) transporter 305427 Protein of unknown function, DUF270 305427 Protein of unknown function, DUF270 305428 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 305428 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 305431 Protein kinase domain 305434 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 305434 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 305438 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 305440 Villin headpiece domain 305441 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 305445 Homeobox domain 305449 Sugar (and other) transporter 305450 SH2 domain 305450 PH domain. PH stands for pleckstrin homology 305451 Ribosomal protein L21e 305454 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 305456 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 305456 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 305456 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 305462 Proteasome A-type and B-type 305464 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 305466 Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <=> phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in t 305470 GTP1/OBG family 305470 GTPase of unknown function 305470 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 305470 TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae 305471 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 305472 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 305475 Oxysterol-binding protein 305477 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 305478 Zinc finger, C3HC4 type (RING finger) 305479 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 305479 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 305481 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 305482 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 305484 CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2 305487 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 305491 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 305492 Ribosomal protein S24e 305494 Zinc carboxypeptidase 305494 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 305500 SH2 domain 305500 PH domain. PH stands for pleckstrin homology 305502 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 305506 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues 305509 Adenylate and Guanylate cyclase catalytic domain 305510 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 305514 Ribosomal protein S8 305517 Ribosomal protein L6 305522 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 305539 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 305540 Sodium:dicarboxylate symporter family 305541 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 305545 ENV polyprotein (coat polyprotein) 305547 Galactoside-binding lectin 305550 Ribosomal protein L21e 305556 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 305571 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 305574 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 305578 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 305581 Peroxin 13, N-terminal domain. Both termini of the Peroxin-13 are oriented to the cytosol. Peroxin-13 is required for peroxisomal association of peroxin-14 305581 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 305584 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 305584 Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has 305586 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 305605 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 305605 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 305606 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 305606 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 305610 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 305613 PH domain. PH stands for pleckstrin homology 305615 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 305615 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305619 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 305621 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 305622 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 305622 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 305622 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 305622 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 305626 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 305627 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 305627 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 305629 7 transmembrane receptor (metabotropic glutamate family) 305630 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 305630 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305631 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 305632 Ribosomal S3Ae family 305635 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 305639 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 305640 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 305641 Myosin head (motor domain) 305642 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 305644 WD domain, G-beta repeat 305668 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 305669 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 305670 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 305675 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 305676 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 305679 Vinculin family 305681 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 305683 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 305684 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 305685 ubiE/COQ5 methyltransferase family 305685 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 305692 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 305692 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 305694 Glutaredoxin 305696 Glutaredoxin 305705 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 305706 ENV polyprotein (coat polyprotein) 305709 Transcription factor S-II (TFIIS) 305710 Protein kinase domain 305711 7 transmembrane receptor (rhodopsin family) 305712 Enolase, N-terminal domain 305712 Enolase, C-terminal TIM barrel domain 305722 Eukaryotic-type carbonic anhydrase 305733 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 305734 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 305735 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 305750 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 305750 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305751 ADP-ribosylation factor family 305751 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 305755 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 305756 Enolase, C-terminal TIM barrel domain 305760 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 305761 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 305764 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 305764 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 305766 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 305766 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 305767 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 305767 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 305767 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 305772 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 305775 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 305776 ENV polyprotein (coat polyprotein) 305777 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 305778 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 305779 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 305781 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 305787 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 305792 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 305793 Protein kinase domain 305793 PX domain. PX domains bind to phosphoinositides 305794 Uncharacterized protein PaaI, COG2050 305794 MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes 305795 Lipase 305795 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 305795 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to 305797 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 305798 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 305799 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 305799 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305800 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 305800 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 305801 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 305805 7 transmembrane receptor (rhodopsin family) 305806 Glutaredoxin 305811 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 305811 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 305813 Polyprenyl synthetase 305814 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 305816 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 305820 Core histone H2A/H2B/H3/H4 305824 Enolase, N-terminal domain 305824 Enolase, C-terminal TIM barrel domain 305826 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 305827 WD domain, G-beta repeat 305827 HMG (high mobility group) box 305828 SH2 domain 305830 F-box domain 305839 Ribosomal protein L23 305839 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 305840 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 305841 7 transmembrane receptor (rhodopsin family) 305841 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 305842 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 305843 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 305843 ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F 305844 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 305846 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 305850 C2 domain 305851 metallopeptidase family M24 305853 ADP-ribosylation factor family 305853 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 305854 7 transmembrane receptor (rhodopsin family) 305858 Homeobox domain 305858 Otx1 transcription factor 305860 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 305861 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 305863 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 305866 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305867 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305868 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305869 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305870 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305873 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305874 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305875 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 305880 CUB domain 305886 Sugar (and other) transporter 305886 Sugar (and other) transporter 305888 Homeobox domain 305889 Adenylate cyclase. One member was found to complement mutants of E. coli. cya. This protein has been shown to be an adenylate kinase 305892 Ribosomal S17 305896 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt 305898 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 305899 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 305902 Trypsin 305903 Trypsin 305904 eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 305904 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 305904 eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 305906 Trypsin 305908 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 305910 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 305915 Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus 305916 Ribosomal protein L23 305917 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 305922 Protein kinase domain 305923 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 305930 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 305931 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 305932 Cytidine and deoxycytidylate deaminase zinc-binding region 305934 Ribosomal protein L23 305938 PH domain. PH stands for pleckstrin homology 305938 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 305938 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 305938 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 305938 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 305943 ADP-ribosylation factor family 305943 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 305947 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 305950 Ribosomal protein L23 305951 WD domain, G-beta repeat 305951 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 305956 WD domain, G-beta repeat 305956 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 305957 Formamidopyrimidine-DNA glycosylase 305958 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 305964 Peptide methionine sulfoxide reductase. This enzyme repairs damaged proteins. Methionine sulfoxide in proteins is reduced to methionine 305967 Kinesin motor domain 305967 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 305968 Homeobox domain 305974 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 305977 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 305979 Ribosomal L29e protein family 305980 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 305985 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 305985 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 305986 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 305986 Pentapeptide repeats (8 copies). These repeats are found in many cyanobacterial proteins. The repeats were first identified in hglK. The function of these repeats is unknown. The structure of this repeat has been predicted to be a beta-helix. The repeat c 305986 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 305988 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 305990 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 305993 ENV polyprotein (coat polyprotein) 305993 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 305996 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 305998 Ribosomal S17 305999 Homeobox domain 306000 Mitochondrial carrier protein 306001 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 306004 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 306004 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 306005 Phosphatidylethanolamine-binding protein 306006 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 306008 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 306009 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 306009 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 306010 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 306011 Ribosomal L10 306011 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 306011 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 306013 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 306013 EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, b 306014 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 306016 Ribosomal protein L31e 306020 Ribosomal protein L21e 306022 Fibronectin type III domain 306026 Mitochondrial carrier protein 306038 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 306044 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 306044 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 306055 Cadherin domain 306066 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 306071 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 306071 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 306072 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 306073 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 306073 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 306079 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 306081 Cadherin domain 306085 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 306091 Cadherin domain 306094 ENV polyprotein (coat polyprotein) 306097 HMG14 and HMG17 306100 Elongation factor 1 gamma, conserved domain 306100 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 306103 RNB-like protein. The function of this region of similarity is uncertain 306104 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 306104 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 306114 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 306115 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 306115 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 306117 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 306118 Phosphotyrosine interaction domain (PTB/PID) 306126 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 306127 MmgE/PrpD family. This family includes 2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is required for propionate catabolism. It catalyses the third step of the 2-methylcitric acid cycle 306129 F-box domain 306138 Ribosomal protein S15 306138 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306147 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 306152 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 306157 Glypican 306160 Ribosomal L10 306163 Glypican 306164 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 306164 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 306164 Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI) 306164 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 306166 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 306166 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 306168 HMG (high mobility group) box 306169 PMP-22/EMP/MP20/Claudin family 306179 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306181 Ribosomal protein L21e 306183 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 306184 PH domain. PH stands for pleckstrin homology 306184 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 306185 Protein kinase domain 306186 PH domain. PH stands for pleckstrin homology 306188 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306190 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306191 7 transmembrane receptor (rhodopsin family) 306192 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 306193 7 transmembrane receptor (rhodopsin family) 306194 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 306195 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 306196 Endomembrane protein 70 306197 Endomembrane protein 70 306201 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 306203 PX domain. PX domains bind to phosphoinositides 306204 Filamin/ABP280 repeat 306205 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 306206 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306207 Ribosomal L29e protein family 306208 Protein kinase domain 306210 Glutaredoxin 306212 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 306214 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 306222 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 306223 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306226 Homeobox domain 306231 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 306232 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 306243 Cache domain 306244 Cache domain 306248 Ribosomal protein L34e 306249 Polyprenyl synthetase 306252 Protein kinase domain 306253 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 306257 Ubiquitin carboxyl-terminal hydrolase, family 1 306259 Ribosomal protein S24e 306260 MIT domain 306260 Calpain family cysteine protease 306260 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 306262 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 306264 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 306268 QXW lectin repeat 306268 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 306270 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 306270 Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous 306271 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306274 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 306274 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 306279 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 306280 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 306280 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 306281 Protein-tyrosine phosphatase 306281 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 306283 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 306285 C2 domain 306285 C2 domain 306288 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 306291 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 306292 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 306307 7 transmembrane receptor (rhodopsin family) 306312 Ribosomal protein L6e 306324 Tetraspanin family 306328 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 306328 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 306330 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 306333 SH2 domain 306334 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306335 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306338 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 306347 Occludin/ELL family 306351 Transforming growth factor beta like domain 306353 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 306357 7 transmembrane receptor (rhodopsin family) 306357 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 306357 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 306358 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306359 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306360 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306361 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306363 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306364 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306376 SH2 domain 306384 PH domain. PH stands for pleckstrin homology 306388 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 306398 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 306399 Zinc carboxypeptidase 306400 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 306402 HMG (high mobility group) box 306403 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306404 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 306405 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306406 Ribosomal S3Ae family 306409 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 306409 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 306412 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 306415 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 306417 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 306419 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306421 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 306421 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 306421 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 306421 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 306423 Cyclophilin type peptidyl-prolyl cis-trans isomerase 306425 Eukaryotic ribosomal protein L18 306428 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 306431 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 306431 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 306432 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 306436 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 306439 Myelin proteolipid protein (PLP or lipophilin) 306439 Myelin proteolipid protein (PLP or lipophilin) 306441 WD domain, G-beta repeat 306441 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 306444 Poly-adenylate binding protein, unique domain 306444 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 306447 Ribosomal L10 306451 Macrophage migration inhibitory factor (MIF) 306454 PMP-22/EMP/MP20/Claudin family 306456 Uncharacterized ACR, COG1565 306456 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 306460 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 306462 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 306467 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 306467 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 306471 PX domain. PX domains bind to phosphoinositides 306471 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 306474 Ribosomal protein S26e 306476 Ribosomal L29e protein family 306477 Domain of unknown function UPF0099. This domain has no known function 306487 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 306496 Macrophage scavenger receptor 306496 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 306500 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 306504 ubiE/COQ5 methyltransferase family 306505 Core histone H2A/H2B/H3/H4 306505 Zinc finger, C3HC4 type (RING finger) 306505 ATP-dependent protease La (LON) domain 306506 Protein kinase domain 306509 Ribosomal protein L21e 306511 Ribosomal protein L6 306512 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 306515 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 306516 TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription i 306527 Protein-tyrosine phosphatase 306527 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 306535 Nucleoside diphosphate kinase 306540 Aminotransferase class I and II 306541 C2 domain 306544 Uncharacterized protein family UPF0001 306544 Prismane/CO dehydrogenase family. This family includes both hybrid-cluster proteins and the beta chain of carbon monoxide dehydrogenase. The hybrid-cluster proteins contain two Fe/S centers - a [4Fe-4S] cubane cluster, and a hybrid [4Fe-2S-2O] cluster. Th 306548 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 306549 Protein kinase domain 306551 Calcium-activated BK potassium channel alpha subunit 306551 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 306552 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 306554 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 306555 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 306562 Intermediate filament protein 306562 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 306562 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 306562 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 306564 Trypsin 306564 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 306567 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 306571 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 306571 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 306574 Mitochondrial carrier protein 306576 Protein kinase domain 306577 Protein kinase domain 306579 Mammalian defensin 306579 Defensin propeptide 306581 NAC domain 306582 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 306583 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 306586 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 306589 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 306592 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 306594 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 306597 HMG (high mobility group) box 306600 E1-E2 ATPase 306605 C2 domain 306606 PH domain. PH stands for pleckstrin homology 306606 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 306606 BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region 306607 Uncharacterized ACR, COG1579 306608 Trypsin 306608 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 306608 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 306609 CUB domain 306613 CUB domain 306613 Sushi domain (SCR repeat) 306624 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 306626 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 306627 ADP-ribosylation factor family 306627 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 306630 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 306636 Ephrin 306640 Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase 306643 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 306655 Homeobox domain 306657 Homeobox domain 306659 Homeobox domain 306663 Thrombospondin type 1 domain 306663 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 306666 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 306667 ENV polyprotein (coat polyprotein) 306669 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 306672 PAP/25A associated domain 306672 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 306674 Cyclophilin type peptidyl-prolyl cis-trans isomerase 306675 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306677 ENV polyprotein (coat polyprotein) 306678 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306679 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306680 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306681 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306682 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 306683 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 306685 Trypsin 306690 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 306692 Ribosomal protein L31e 306694 Protein-tyrosine phosphatase 306694 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 306698 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 306698 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 306698 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 306705 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306706 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306708 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306709 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306710 Ribosomal protein L24e 306711 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306714 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306715 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306717 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306722 Death domain 306729 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 306736 Kinesin motor domain 306739 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 306749 Mitochondrial carrier protein 306750 F-box domain 306759 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 306760 PTB domain (IRS-1 type) 306761 Trypsin 306761 Fibronectin type II domain 306761 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 306761 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 306762 Profilin 306764 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 306764 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 306764 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 306767 ADP-ribosylation factor family 306771 Tetraspanin family 306782 Protein kinase domain 306782 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 306782 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 306782 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 306786 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 306790 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 306790 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 306792 Actin 306792 7 transmembrane receptor (rhodopsin family) 306792 C. elegans Sre G protein-coupled chemoreceptor. Caenorhabditis elegans Sre proteins are candidate chemosensory receptors. There are four main recognized groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre (this family), Sra pfam02117 and Srb pfam0217 306795 ENV polyprotein (coat polyprotein) 306798 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 306800 Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subun 306802 Protein kinase domain 306803 Protein kinase domain 306803 HMG (high mobility group) box 306804 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 306806 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306807 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 306811 Protein kinase domain 306812 Kinesin motor domain 306812 Protein kinase C terminal domain 306814 jmjC domain 306816 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306818 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 306819 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 306825 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 306827 Core histone H2A/H2B/H3/H4 306840 Sir2 family 306840 NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide 306842 Translation initiation factor SUI1 306842 Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD 306842 pfam02894, GFO_IDH_MocA_C, Oxidoreductase family, C-terminal alpha/beta domain. This family of enzymes utilise NADP or NAD. This family is called the GFO/IDH/MOCA family 306859 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 306860 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 306861 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 306862 Transcription factor AP-2 306869 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 306870 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 306872 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 306873 Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element 306878 Ferredoxin-fold anticodon binding domain. This is the anticodon binding domain found in some phenylalanyl tRNA synthetases. The domain has a ferredoxin fold 306879 tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain 306885 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306886 Death domain 306886 Protein kinase domain 306886 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 306889 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 306890 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 306891 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 306892 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 306894 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 306895 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 306896 Protein kinase domain 306900 Fork head domain 306913 Uncharacterized protein family UPF0004. This family is the N terminal half of the family. The C-terminal half has been shown to be related to MiaB proteins 306929 Somatotropin hormone family 306930 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306936 Ribosomal protein S19e 306945 Core histone H2A/H2B/H3/H4 306945 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 306947 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 306948 Core histone H2A/H2B/H3/H4 306948 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 306949 Core histone H2A/H2B/H3/H4 306954 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 306955 Core histone H2A/H2B/H3/H4 306956 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 306957 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 306958 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 306961 WD domain, G-beta repeat 306962 Core histone H2A/H2B/H3/H4 306963 Core histone H2A/H2B/H3/H4 306964 Core histone H2A/H2B/H3/H4 306964 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 306966 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306967 Core histone H2A/H2B/H3/H4 306967 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 306968 Core histone H2A/H2B/H3/H4 306969 Core histone H2A/H2B/H3/H4 306969 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 306970 Core histone H2A/H2B/H3/H4 306971 Core histone H2A/H2B/H3/H4 306972 Core histone H2A/H2B/H3/H4 306973 Core histone H2A/H2B/H3/H4 306974 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 306974 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306974 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 306975 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 306977 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 306977 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 306977 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 306978 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 306979 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 306982 Intermediate filament protein 306990 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 306993 Homeobox domain 306993 Pou domain - N-terminal to homeobox domain 306997 Ribosomal protein L21e 306998 Protein kinase domain 307006 Ribosomal protein S6e 307007 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 307010 WD domain, G-beta repeat 307011 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 307011 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 307013 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 307016 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 307016 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 307016 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307023 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 307028 ENV polyprotein (coat polyprotein) 307035 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 307035 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307036 Guanylate kinase 307036 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 307036 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 307039 ADP-ribosylation factor family 307039 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 307043 Protein kinase domain 307047 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 307049 Ribosomal protein L15 307050 PAP/25A associated domain 307051 RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown 307051 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 307052 Ribosomal protein L36e 307056 PH domain. PH stands for pleckstrin homology 307056 Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined 307057 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 307058 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 307059 Ribosomal protein L6 307067 AMP-binding enzyme 307068 AMP-binding enzyme 307073 NUDIX domain 307074 NUDIX domain 307075 WD domain, G-beta repeat 307079 Ribosomal protein L21e 307080 Chaperonin 10 Kd subunit 307081 Insulinase (Peptidase family M16) 307082 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307084 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 307091 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 307092 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 307093 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 307095 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 307096 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 307097 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307098 PH domain. PH stands for pleckstrin homology 307098 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 307099 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 307100 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 307101 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 307102 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 307106 Sterile alpha motif (SAM)/Pointed domain 307106 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 307107 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 307108 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 307109 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 307114 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues 307123 Protein kinase domain 307123 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 307124 Protein kinase domain 307132 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 307133 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 307134 ENV polyprotein (coat polyprotein) 307134 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307135 Ribosomal protein L34e 307136 Acyl CoA binding protein 307137 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 307144 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 307148 NOL1/NOP2/sun family 307148 Conserved hypothetical protein 95 307148 ubiE/COQ5 methyltransferase family 307148 FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple 307151 Zinc finger, C3HC4 type (RING finger) 307151 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 307154 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307156 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 307157 SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad 307159 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 307167 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307167 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307168 Protein kinase domain 307169 Protein kinase domain 307170 Acyl CoA binding protein 307171 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 307173 Ribosomal protein S19e 307177 Shikimate kinase 307177 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 307177 Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni 307178 PH domain. PH stands for pleckstrin homology 307178 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 307178 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 307180 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 307189 Nebulin repeat 307189 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307192 Actin 307192 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307196 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 307200 SH2 domain 307202 Core histone H2A/H2B/H3/H4 307203 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 307204 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307204 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307205 CUB domain 307206 Enolase, N-terminal domain 307206 Enolase, C-terminal TIM barrel domain 307209 F-box domain 307212 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 307221 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 307234 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 307234 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 307235 Ribosome-binding factor A 307237 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 307238 SKIP/SNW domain. This domain is found in chromatin proteins 307239 HMG (high mobility group) box 307248 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 307248 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 307251 Zinc finger, C3HC4 type (RING finger) 307253 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 307255 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 307263 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307263 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307266 Protein kinase domain 307267 Protein kinase domain 307269 Ribosomal protein L31e 307270 Malic enzyme, N-terminal domain 307270 Malic enzyme, NAD binding domain 307288 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 307289 Eukaryotic porin 307298 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307298 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307304 Macrophage migration inhibitory factor (MIF) 307309 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 307314 Ribosomal protein S5, N-terminal domain 307315 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307317 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307317 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307318 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307319 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307323 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307326 Ribosomal protein L19e 307327 Ribosomal protein L19e 307327 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307328 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307329 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307333 Ribosomal protein L19e 307334 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nit 307343 Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown 307350 Peptidase family M41 307350 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 307351 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 307351 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 307352 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 307353 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 307365 Ribosomal protein L13 307366 Death domain 307370 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 307371 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 307376 Homeobox domain 307376 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 307384 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 307391 RasGAP C-terminus 307393 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307394 LIM domain. This family represents two copies of the LIM structural domain 307395 Villin headpiece domain 307395 LIM domain. This family represents two copies of the LIM structural domain 307398 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 307398 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 307398 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307400 3'5'-cyclic nucleotide phosphodiesterase 307401 3'5'-cyclic nucleotide phosphodiesterase 307401 GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases 307402 HMG (high mobility group) box 307403 Protein kinase domain 307403 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 307404 Sulfatase 307410 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307415 GTPase of unknown function 307415 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 307416 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 307420 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 307421 Isochorismatase family. This family are hydrolase enzymes 307427 WD domain, G-beta repeat 307431 HMG (high mobility group) box 307432 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307433 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307438 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 307440 Ribosomal L29e protein family 307441 HMG (high mobility group) box 307444 Translation initiation factor SUI1 307444 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307444 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307448 Skp1 family, dimerisation domain 307450 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307451 High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin 307451 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 307453 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 307459 PH domain. PH stands for pleckstrin homology 307465 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 307465 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 307471 Zinc finger, C3HC4 type (RING finger) 307472 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307474 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 307474 FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions 307475 7 transmembrane receptor (rhodopsin family) 307475 7 transmembrane receptor (rhodopsin family) 307476 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 307476 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 307481 Cadherin domain 307482 PH domain. PH stands for pleckstrin homology 307482 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 307482 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 307482 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 307482 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307482 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 307484 Ribosomal protein L21e 307486 Cadherin domain 307487 Cadherin domain 307488 Cadherin domain 307489 Cadherin domain 307490 Cadherin domain 307491 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 307491 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 307492 WHEP-TRS domain 307492 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 307492 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 307496 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 307500 PH domain. PH stands for pleckstrin homology 307500 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 307503 eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 307503 eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 307503 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 307505 Vinculin family 307506 Vinculin family 307508 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 307511 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 307512 SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. The 307522 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 307523 Myosin head (motor domain) 307523 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 307524 WD domain, G-beta repeat 307525 RNA polymerase Rpb4 307530 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 307540 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307554 Ribosomal protein S27 307556 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 307557 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 307561 Zinc finger, C3HC4 type (RING finger) 307562 Cadherin domain 307563 Cadherin domain 307563 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 307582 Laminin B (Domain IV) 307582 Intermediate filament protein 307582 Laminin N-terminal (Domain VI) 307582 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 307582 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 307582 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 307585 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 307587 Mitochondrial carrier protein 307594 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 307602 Elongation factor 1 gamma, conserved domain 307602 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 307603 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 307607 Ribosomal protein L19e 307607 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 307611 Synaptogyrin. This family of proteins is distantly related to pfam01284 307612 Synaptogyrin. This family of proteins is distantly related to pfam01284 307614 Cadherin domain 307618 Cadherin domain 307618 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 307623 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 307623 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307631 Cadherin domain 307631 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 307641 Protein kinase domain 307650 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 307653 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 307660 Carboxylesterase 307662 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 307665 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 307667 HMG (high mobility group) box 307668 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 307669 Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses 307672 Myosin head (motor domain) 307674 Myosin head (motor domain) 307675 Myosin head (motor domain) 307684 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307688 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 307689 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307690 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 307690 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307694 Myosin head (motor domain) 307696 Myosin head (motor domain) 307700 Myosin head (motor domain) 307707 Myosin head (motor domain) 307710 Carboxylesterase 307711 Ribosomal protein S26e 307711 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 307713 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 307715 Homeobox domain 307724 C2 domain 307727 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 307727 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 307727 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 307731 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 307731 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 307738 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 307738 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 307740 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 307742 PX domain. PX domains bind to phosphoinositides 307744 Adenylate and Guanylate cyclase catalytic domain 307745 PAP/25A associated domain 307745 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 307745 Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to 307749 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 307750 Ribosomal protein L19e 307751 Ribosomal protein S8 307752 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 307757 CUB domain 307758 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307759 Aminotransferase class I and II 307760 Ribosomal protein S17 307761 Protein kinase domain 307762 Ribosomal protein S19 307763 Ribosomal L29e protein family 307763 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 307764 Ubiquitin carboxyl-terminal hydrolase family 2 307764 Ubiquitin carboxyl-terminal hydrolases family 2 307767 Calx-beta domain 307771 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 307772 Anaphase-promoting complex, subunit 10 (APC10) 307772 Poxvirus D5 protein. This protein is necessary for viral DNA replication, and is a nucleic acid independent nucleoside triphosphatase 307775 Ribosomal protein S26e 307778 Ribosomal L28e protein family 307779 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307782 Clathrin adaptor complex small chain 307784 Domain of unknown function 307790 Mitochondrial carrier protein 307791 Carboxylesterase 307794 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 307796 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 307797 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 307799 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 307801 Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD 307802 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307803 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 307805 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 307807 Ribosomal protein L23 307808 Protein kinase domain 307809 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour 307810 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307810 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour 307811 Amino acid permease 307812 Conserved hypothetical protein 95 307815 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 307821 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 307829 Homeobox domain 307833 Eukaryotic protein of unknown function, DUF292 307836 PH domain. PH stands for pleckstrin homology 307836 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 307839 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 307849 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 307849 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 307850 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 307851 SCP-like extracellular protein. This domain is also found in prokaryotes 307851 Lectin C-type domain. This family includes both long and short form C-type 307852 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 307852 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 307853 Fatty acid hydroxylase 307853 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 307859 Sulfotransferase protein 307860 Adenosine-deaminase (editase) domain 307862 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 307868 SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs 307872 Lectin C-type domain. This family includes both long and short form C-type 307873 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 307882 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 307893 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 307893 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 307897 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 307897 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 307900 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 307900 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 307902 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 307903 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 307903 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 307905 Ubiquitin carboxyl-terminal hydrolase family 2 307905 Ubiquitin carboxyl-terminal hydrolases family 2 307915 Uncharacterised protein family (UPF0172) 307916 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 307920 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307922 AMP-binding enzyme 307923 Vacuolar sorting protein 9 (VPS9) domain 307925 Ribosomal protein L19e 307927 WD domain, G-beta repeat 307935 B-box zinc finger 307938 Fibronectin type III domain 307938 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 307947 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 307948 Protein of unknown function, DUF265 307950 Protein kinase domain 307950 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 307952 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 307956 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 307960 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 307968 C2 domain 307968 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 307973 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 307979 HMG (high mobility group) box 307982 Nebulin repeat 307983 Nebulin repeat 307983 LIM domain. This family represents two copies of the LIM structural domain 307988 von Willebrand factor type A domain 307988 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 307989 Villin headpiece domain 307989 LIM domain. This family represents two copies of the LIM structural domain 307990 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 307992 Delta serrate ligand 307992 Lectin C-type domain. This family includes both long and short form C-type 307992 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 307997 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 307999 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 308000 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 308000 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 308001 Homeobox domain 308003 C2 domain 308005 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308007 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308011 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308012 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308013 Carboxylesterase 308017 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 308021 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 308026 Protein kinase domain 308028 Sugar (and other) transporter 308032 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308036 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 308036 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 308037 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 308039 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 308044 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 308045 HIT family 308045 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 308047 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 308048 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 308051 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 308052 Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase 308053 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 308061 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 308062 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 308064 Intermediate filament protein 308066 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 308067 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 308074 Protein kinase domain 308076 Protein kinase domain 308078 Protein kinase domain 308078 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308079 Protein kinase domain 308080 Protein kinase domain 308081 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 308086 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 308097 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 308099 Fibronectin type III domain 308100 Intermediate filament protein 308100 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 308100 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 308100 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 308100 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 308101 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 308101 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 308102 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 308102 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 308103 7 transmembrane receptor (rhodopsin family) 308106 Protein kinase domain 308108 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 308109 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 308113 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 308113 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 308118 ATP synthase 308118 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 308119 Peptide hormone. This family contains glucagon, GIP, secretin and VIP 308121 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 308126 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 308127 Formate--tetrahydrofolate ligase 308127 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 308127 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 308127 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 308129 Nitroreductase family. Members of this family utilise FMN as a cofactor 308136 ADP-ribosylation factor family 308140 Ribosomal RNA adenine dimethylase 308141 PMP-22/EMP/MP20/Claudin family 308142 PH domain. PH stands for pleckstrin homology 308142 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 308142 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 308163 7 transmembrane receptor (rhodopsin family) 308164 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 308170 Protein kinase domain 308172 Protein kinase domain 308173 Kinesin motor domain 308174 CAP protein 308176 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 308177 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 308178 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 308182 Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family 308183 Protein kinase domain 308183 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308184 Protein kinase domain 308185 Protein kinase domain 308185 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308186 Protein kinase domain 308186 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308188 Protein kinase domain 308190 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 308191 Exocyst complex subunit Sec15-like 308191 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 308191 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 308193 Protein kinase domain 308194 Protein kinase domain 308195 Protein kinase domain 308196 Protein kinase domain 308197 Protein kinase domain 308197 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308198 Protein kinase domain 308199 Protein kinase domain 308199 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308200 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 308200 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 308201 RIO1/ZK632.3/MJ0444 family 308203 Protein kinase domain 308203 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 308204 Protein kinase domain 308205 Protein kinase domain 308205 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308209 Kinesin motor domain 308209 Protein kinase domain 308209 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 308210 HMG (high mobility group) box 308210 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 308211 Protein kinase domain 308213 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 308213 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 308214 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 308224 WD domain, G-beta repeat 308231 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308232 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308233 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308234 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308236 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308237 Ribosomal L18ae protein family 308238 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308239 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308239 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308240 Protein kinase domain 308240 7 transmembrane receptor (metabotropic glutamate family) 308240 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308241 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308242 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308243 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308244 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308245 7 transmembrane receptor (metabotropic glutamate family) 308245 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308246 7 transmembrane receptor (metabotropic glutamate family) 308246 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308247 Protein kinase domain 308248 7 transmembrane receptor (metabotropic glutamate family) 308248 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308250 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308251 7 transmembrane receptor (metabotropic glutamate family) 308251 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308252 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308253 7 transmembrane receptor (metabotropic glutamate family) 308253 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308254 ENV polyprotein (coat polyprotein) 308254 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 308254 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308254 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 308254 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308255 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308256 7 transmembrane receptor (metabotropic glutamate family) 308256 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308257 E1-E2 ATPase 308257 7 transmembrane receptor (metabotropic glutamate family) 308257 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308257 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 308257 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308263 Ribosomal protein L23 308265 Protein kinase domain 308265 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 308267 Uncharacterized ACR, COG1579 308269 Fibrillarin 308282 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 308282 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 308282 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 308282 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 308283 F-box domain 308286 Mitochondrial carrier protein 308287 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 308287 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 308287 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 308287 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308290 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 308290 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308291 Low temperature viability protein 308294 Cytidine and deoxycytidylate deaminase zinc-binding region 308296 7 transmembrane receptor (metabotropic glutamate family) 308297 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308297 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308298 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308299 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308300 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308301 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 308301 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 308302 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 308303 7 transmembrane receptor (metabotropic glutamate family) 308303 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308304 7 transmembrane receptor (metabotropic glutamate family) 308304 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308305 7 transmembrane receptor (metabotropic glutamate family) 308305 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308306 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 308306 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 308307 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308308 7 transmembrane receptor (metabotropic glutamate family) 308309 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 308311 Not1 N-terminal domain, CCR4-Not complex component 308311 NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5 308312 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308313 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 308313 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 308314 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 308315 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 308316 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308318 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308320 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308322 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308324 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308325 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308326 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308330 7 transmembrane receptor (rhodopsin family) 308330 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 308330 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 308330 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308331 Homeobox domain 308332 Protein kinase domain 308333 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 308334 7 transmembrane receptor (metabotropic glutamate family) 308334 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308335 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 308336 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308340 Protein kinase domain 308341 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 308341 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308344 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 308347 Protein kinase domain 308349 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 308353 Ribosomal protein L44 308354 7 transmembrane receptor (rhodopsin family) 308354 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 308355 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 308355 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 308355 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308356 Ribosomal protein S21e 308356 7 transmembrane receptor (metabotropic glutamate family) 308358 7 transmembrane receptor (metabotropic glutamate family) 308358 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308359 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 308362 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 308362 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308362 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308363 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308363 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308364 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308365 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 308365 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308365 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308367 Ribosomal protein L6 308368 Ribosomal protein S19e 308372 HMG (high mobility group) box 308376 7 transmembrane receptor (Secretin family) 308376 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 308377 Connexin 308378 7 transmembrane receptor (rhodopsin family) 308379 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 308380 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 308383 ENV polyprotein (coat polyprotein) 308384 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 308386 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308388 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 308389 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308392 WD domain, G-beta repeat 308397 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 308400 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 308401 WD domain, G-beta repeat 308402 Class I Histocompatibility antigen, domains alpha 1 and 2 308402 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 308405 Protein kinase domain 308407 HMG (high mobility group) box 308408 7 transmembrane receptor (rhodopsin family) 308410 Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues 308420 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308420 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308421 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308422 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308423 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308424 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308424 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308429 Phospholipase A2 inhibitor 308431 PH domain. PH stands for pleckstrin homology 308435 HMG (high mobility group) box 308440 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 308441 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 308444 Protein kinase domain 308444 Fibronectin type III domain 308444 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 308445 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 308446 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 308446 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 308449 Protein kinase domain 308451 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 308451 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 308454 Phosphotyrosine interaction domain (PTB/PID) 308454 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 308455 PH domain. PH stands for pleckstrin homology 308455 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 308457 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 308457 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 308458 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 308458 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 308459 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 308461 Protein kinase domain 308463 Protein kinase domain 308463 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 308465 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308466 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308467 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308468 Protein kinase domain 308472 KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG 308473 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 308476 Calpain family cysteine protease 308476 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 308478 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 308482 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308483 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 308483 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 308483 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308484 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 308485 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308486 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308487 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308487 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308488 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308489 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308490 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 308490 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308490 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 308491 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308492 WD domain, G-beta repeat 308492 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308493 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 308493 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 308495 Fibronectin type III domain 308495 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 308495 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 308496 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 308497 Amyloid A4 extracellular domain 308498 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 308498 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 308501 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 308503 Glycosyltransferase family 6 308503 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 308505 Hsp90 protein 308506 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 308507 LIM domain. This family represents two copies of the LIM structural domain 308508 Repeat in ubiquitin-activating (UBA) protein 308508 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 308510 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 308511 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 308525 Ribosomal protein L15 308532 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308533 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308534 SH2 domain 308535 ENV polyprotein (coat polyprotein) 308535 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308536 Phosphoglycerate kinase 308537 Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b 308540 Ribosomal protein L13 308541 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308542 PH domain. PH stands for pleckstrin homology 308542 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 308543 PH domain. PH stands for pleckstrin homology 308543 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 308550 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 308550 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308551 7 transmembrane receptor (metabotropic glutamate family) 308551 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308553 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 308554 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308559 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308560 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308561 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308561 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308562 Trypsin 308564 Trypsin 308565 Trypsin 308566 Trypsin 308569 Histidine acid phosphatase 308569 7 transmembrane receptor (rhodopsin family) 308570 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308571 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 308575 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 308578 Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site 308578 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 308578 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 308579 Choline/Carnitine o-acyltransferase 308582 TEA/ATTS domain family 308583 Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD 308583 pfam02894, GFO_IDH_MocA_C, Oxidoreductase family, C-terminal alpha/beta domain. This family of enzymes utilise NADP or NAD. This family is called the GFO/IDH/MOCA family 308584 PH domain. PH stands for pleckstrin homology 308585 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 308586 Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes 308587 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 308587 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 308588 Eukaryotic-type carbonic anhydrase 308588 Eukaryotic-type carbonic anhydrase 308589 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 308592 WD domain, G-beta repeat 308597 Serum amyloid A protein 308600 Homeobox domain 308600 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 308600 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 308601 Ribosomal L38e protein family 308601 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308603 7 transmembrane receptor (rhodopsin family) 308605 7 transmembrane receptor (rhodopsin family) 308606 7 transmembrane receptor (rhodopsin family) 308607 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 308608 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308623 Hsp90 protein 308623 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 308636 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 308638 Ribosomal L18ae protein family 308638 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 308638 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 308641 Growth-Arrest-Specific Protein 2 Domain 308645 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 308649 Protein kinase domain 308650 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 308652 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 308653 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 308653 TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle 308661 Mitochondrial carrier protein 308662 Ribosomal protein L34e 308662 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 308663 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 308665 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 308669 Regulator of chromosome condensation (RCC1) 308669 Anaphase-promoting complex, subunit 10 (APC10) 308669 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 308669 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 308669 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 308670 C-5 cytosine-specific DNA methylase 308678 Mitochondrial carrier protein 308690 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 308691 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 308694 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 308694 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 308700 tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases 308700 TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae 308701 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 308701 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 308702 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be 308703 Protein kinase domain 308703 ADP-ribosylation factor family 308705 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 308706 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 308706 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 308709 Intermediate filament protein 308709 Intermediate filament protein 308710 Pyroglutamyl peptidase 308710 Cyclophilin type peptidyl-prolyl cis-trans isomerase 308711 Ribosomal protein S18 308713 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 308715 Tissue factor 308716 Tissue factor 308716 Ribosomal protein S15 308718 WD domain, G-beta repeat 308725 HMG (high mobility group) box 308727 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308732 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 308738 'chromo' (CHRromatin Organization MOdifier) domain 308738 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 308738 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 308742 C2 domain 308744 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 308752 Ribosomal L10 308755 HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease 308757 Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism 308761 Microtubule associated protein (MAP65/ASE1 family) 308762 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 308765 Clathrin adaptor complex small chain 308766 Helix-loop-helix DNA-binding domain 308767 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 308770 Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in 308773 Extracellular link domain 308773 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 308776 3'5'-cyclic nucleotide phosphodiesterase 308777 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 308779 Fibronectin type III domain 308779 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 308787 Thrombospondin type 1 domain 308787 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 308792 START domain 308799 7 transmembrane receptor (metabotropic glutamate family) 308801 ADP-ribosylation factor family 308801 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308802 Phosphoglycerate kinase 308807 Trypsin 308808 Malic enzyme, N-terminal domain 308811 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 308813 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 308815 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 308821 ADP-ribosylation factor family 308821 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308821 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 308826 Ribosomal protein L21e 308833 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 308835 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 308835 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m 308836 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 308837 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 308839 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 308848 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 308850 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 308850 Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be 308850 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 308851 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 308853 7 transmembrane receptor (rhodopsin family) 308854 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 308855 Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for 308857 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 308857 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 308864 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 308865 WD domain, G-beta repeat 308865 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 308865 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a 308865 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 308867 Ribosomal protein L11, RNA binding domain 308868 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 308872 NAD:arginine ADP-ribosyltransferase 308873 NAD:arginine ADP-ribosyltransferase 308875 ADP-ribosylation factor family 308875 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308879 Poly A polymerase family. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of 308880 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 308887 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308887 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 308890 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 308896 Ubiquitin carboxyl-terminal hydrolase family 2 308896 Ubiquitin carboxyl-terminal hydrolases family 2 308898 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 308900 Zinc finger, C3HC4 type (RING finger) 308901 B-box zinc finger 308901 Zinc finger, C3HC4 type (RING finger) 308901 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 308902 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 308904 7 transmembrane receptor (rhodopsin family) 308905 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 308905 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 308906 B-box zinc finger 308906 Zinc finger, C3HC4 type (RING finger) 308906 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 308907 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 308909 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 308917 Olfactomedin-like domain 308919 Trypsin 308919 CUB domain 308919 Oxidoreductase FAD-binding domain 308919 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 308920 ENV polyprotein (coat polyprotein) 308922 Ribosomal L38e protein family 308923 WD domain, G-beta repeat 308925 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 308927 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 308928 HMG (high mobility group) box 308931 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 308932 Peptidase family C54 308933 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 308937 Protein kinase domain 308941 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 308941 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 308942 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 308942 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 308942 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 308942 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 308944 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 308944 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 308944 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 308945 Ribosomal protein L21e 308946 Protein kinase domain 308949 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 308950 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 308951 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 308955 Ribosomal protein L10 308956 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 308956 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 308959 Ubiquitin carboxyl-terminal hydrolase family 2 308965 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 308966 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 308968 Zinc finger, C3HC4 type (RING finger) 308968 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 308972 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 308976 jmjC domain 308985 Ribosomal Proteins L2, C-terminal domain 308985 Ribosomal Proteins L2, RNA binding domain 308988 CUB domain 308988 Sushi domain (SCR repeat) 308990 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 308993 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 308993 AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d 308995 von Willebrand factor type A domain 308995 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 308996 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308997 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 308998 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 308999 F-box domain 308999 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 309001 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309001 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 309006 Ribosomal protein L21e 309007 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309008 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 309010 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 309010 DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera 309014 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 309027 Zona pellucida-like domain 309027 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 309032 WD domain, G-beta repeat 309035 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 309036 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 309036 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309038 AMP-binding enzyme 309039 ENV polyprotein (coat polyprotein) 309041 pfam02926, THUMP, THUMP domain. The THUMP domain is named after after thiouridine synthases, methylases and PSUSs. The THUMP domain consists of about 110 amino acid residues. It is predicted that this domain is an RNA-binding domain that adopts an alpha/b 309044 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 309047 HMG (high mobility group) box 309055 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 309056 Sulfotransferase protein 309063 Ribosomal protein S12 309064 ENV polyprotein (coat polyprotein) 309066 Ribosomal protein S21e 309070 Uroporphyrinogen-III synthase HemD. This family consists of uroporphyrinogen-III synthase HemD EC:4.2.1.75 also known as Hydroxymethylbilane hydrolyase (cyclizing) from eukaryotes, bacteria and archaea. This enzyme catalyses the reaction: Hydroxymethylbil 309071 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309077 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 309081 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 309095 Homeobox domain 309097 7 transmembrane receptor (Secretin family) 309098 Disintegrin 309098 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 309098 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 309099 Ribosomal protein L6 309103 Glycosyl hydrolase family 65 central catalytic domain. This family of glycosyl hydrolases contains vacuolar acid trehalase and maltose phosphorylase.Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phos 309106 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 309108 Intermediate filament tail domain 309110 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 309111 Mitochondrial carrier protein 309113 RNA polymerases N / 8 kDa subunit 309115 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 309118 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 309126 Importin-beta N-terminal domain 309127 Cyclin-dependent kinase regulatory subunit 309129 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 309139 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 309140 Ribosomal prokaryotic L21 protein 309142 Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in 309146 ENV polyprotein (coat polyprotein) 309151 Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage 309153 CAAX amino terminal protease family. Members of this family are probably proteases. the family contains CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidin 309155 Microtubule associated protein (MAP65/ASE1 family) 309163 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 309169 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 309172 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 309173 Mitochondrial carrier protein 309174 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 309174 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 309179 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy 309179 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 309185 Platelet-derived growth factor (PDGF) 309186 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 309191 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 309194 TATA box binding protein associated factor (TAF). TAF proteins adopt a histone-like fold 309195 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 309197 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309198 Fibronectin type III domain 309201 Tetraspanin family 309202 Ribosomal protein L36e 309202 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 309203 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 309204 Ribosomal protein L36e 309207 Lipase (class 3) 309210 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 309212 7 transmembrane receptor (rhodopsin family) 309214 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 309217 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 309218 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 309219 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 309221 7 transmembrane receptor (rhodopsin family) 309222 7 transmembrane receptor (rhodopsin family) 309223 Ribosomal protein L15 309224 DNA directed RNA polymerase, 7 kDa subunit 309225 7 transmembrane receptor (rhodopsin family) 309230 7 transmembrane receptor (rhodopsin family) 309230 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 309231 7 transmembrane receptor (rhodopsin family) 309236 Ribosomal protein S2 309238 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 309240 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 309242 ADP-ribosylation factor family 309242 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 309246 Fork head domain 309248 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 309249 Ribosomal protein L36e 309250 HMG14 and HMG17 309251 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309259 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 309261 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 309262 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 309262 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 309264 LIM domain. This family represents two copies of the LIM structural domain 309275 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 309276 ENV polyprotein (coat polyprotein) 309276 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 309279 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 309280 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 309280 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 309280 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 309281 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 309283 Enolase, N-terminal domain 309283 Enolase, C-terminal TIM barrel domain 309288 metallopeptidase family M24 309288 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 309291 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 309292 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309300 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 309301 RNA 3'-terminal phosphate cyclase 309306 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 309311 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 309312 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 309314 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 309315 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 309317 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 309318 Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a 309320 Core histone H2A/H2B/H3/H4 309321 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 309324 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309325 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 309327 Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3 309328 Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a 309329 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 309331 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 309331 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 309331 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 309334 Cyclic nucleotide-binding domain 309338 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309345 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 309345 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 309348 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 309349 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 309356 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 309356 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 309358 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 309358 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 309359 Ubiquitin carboxyl-terminal hydrolase family 2 309360 Ubiquitin carboxyl-terminal hydrolase family 2 309360 Ubiquitin carboxyl-terminal hydrolases family 2 309361 Protein kinase domain 309362 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 309362 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 309362 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 309363 Ribosomal protein L19e 309368 Laminin N-terminal (Domain VI) 309368 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 309369 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 309373 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 309374 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309376 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 309377 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 309381 Thi4 family. This family includes a putative thiamine biosynthetic enzyme 309381 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 309381 Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th 309382 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 309383 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m 309384 Protein kinase domain 309386 Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m 309389 Homeobox domain 309391 Cytochrome oxidase assembly protein. This is a family of integral membrane proteins. CtaA is required for cytochrome aa3 oxidase assembly in Bacillus subtilis. COX15 is required for cytochrome c oxidase assembly in yeast 309392 CutC family. Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are be 309393 NAC domain 309393 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 309396 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 309396 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 309399 Dienelactone hydrolase family 309399 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 309406 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 309408 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309410 MAM domain. An extracellular domain found in many receptors 309416 Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain i 309419 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 309430 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 309432 Fatty acid desaturase 309433 WD domain, G-beta repeat 309434 jmjC domain 309437 'Paired box' domain 309441 DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the dup 309447 LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in 309448 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309449 GNS1/SUR4 family 309451 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 309451 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 309452 Death domain 309452 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309457 Zinc finger, C3HC4 type (RING finger) 309458 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 309460 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 309462 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309465 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 309465 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 309475 Endomembrane protein 70 309476 CUB domain 309482 Ribosomal protein L24e 309483 Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn 309494 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 309495 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 309499 C2 domain 309500 C2 domain 309501 C2 domain 309504 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 309508 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309509 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309511 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 309511 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 309512 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309512 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 309512 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 309513 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 309514 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 309518 RNase P subunit p30. This protein is part of the RNase P complex that is involved in tRNA maturation 309522 tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla 309523 Kinesin motor domain 309523 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 309523 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 309525 Ribosomal protein L44 309527 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M 309528 ab-hydrolase associated lipase region 309528 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 309529 Ribosomal family S4e 309532 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 309533 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 309541 Ribosomal Proteins L2, C-terminal domain 309541 Ribosomal Proteins L2, RNA binding domain 309543 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 309544 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 309550 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 309552 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 309558 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 309558 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 309562 S-adenosyl-L-homocysteine hydrolase 309562 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 309567 Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 309567 Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold 309574 7 transmembrane receptor (rhodopsin family) 309575 7 transmembrane receptor (rhodopsin family) 309576 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 309577 7 transmembrane receptor (rhodopsin family) 309578 7 transmembrane receptor (rhodopsin family) 309579 7 transmembrane receptor (rhodopsin family) 309583 7 transmembrane receptor (rhodopsin family) 309583 LIM domain. This family represents two copies of the LIM structural domain 309584 Class I Histocompatibility antigen, domains alpha 1 and 2 309584 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309585 B-box zinc finger 309586 B-box zinc finger 309586 Zinc finger, C3HC4 type (RING finger) 309586 Class I Histocompatibility antigen, domains alpha 1 and 2 309586 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309586 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309587 Class I Histocompatibility antigen, domains alpha 1 and 2 309588 Class I Histocompatibility antigen, domains alpha 1 and 2 309588 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309589 Class I Histocompatibility antigen, domains alpha 1 and 2 309590 Class I Histocompatibility antigen, domains alpha 1 and 2 309591 B-box zinc finger 309591 Zinc finger, C3HC4 type (RING finger) 309591 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309592 Class I Histocompatibility antigen, domains alpha 1 and 2 309593 GTPase of unknown function 309595 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 309595 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 309596 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 309598 Class I Histocompatibility antigen, domains alpha 1 and 2 309598 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 309599 Class I Histocompatibility antigen, domains alpha 1 and 2 309599 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309600 Class I Histocompatibility antigen, domains alpha 1 and 2 309601 TNF(Tumor Necrosis Factor) family 309603 Class I Histocompatibility antigen, domains alpha 1 and 2 309605 Class I Histocompatibility antigen, domains alpha 1 and 2 309606 Class I Histocompatibility antigen, domains alpha 1 and 2 309606 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 309607 Class I Histocompatibility antigen, domains alpha 1 and 2 309612 Ribosomal protein L13e 309613 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 309613 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 309615 TPR Domain 309618 Homeobox domain 309618 PBX domain. The PBX domain is a bipartite acidic domain 309619 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309620 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309621 Class II histocompatibility antigen, alpha domain 309621 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309622 Class II histocompatibility antigen, beta domain 309622 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309623 Class II histocompatibility antigen, beta domain 309625 Class II histocompatibility antigen, beta domain 309626 Zinc finger, C3HC4 type (RING finger) 309627 Class I Histocompatibility antigen, domains alpha 1 and 2 309629 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 309630 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 309630 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 309639 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309640 Phosphotyrosine interaction domain (PTB/PID) 309640 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 309642 PH domain. PH stands for pleckstrin homology 309644 Tub family 309647 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 309647 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 309647 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 309649 Intermediate filament protein 309649 ADP-ribosylation factor family 309649 7 transmembrane receptor (rhodopsin family) 309649 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 309649 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 309649 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 309649 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 309650 C2 domain 309650 von Willebrand factor type A domain 309651 Cyclophilin type peptidyl-prolyl cis-trans isomerase 309653 PH domain. PH stands for pleckstrin homology 309653 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 309653 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 309657 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 309658 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 309665 Trypsin 309665 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 309666 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 309666 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 309667 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 309684 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 309688 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 309691 Uncharacterized protein family UPF0054 309694 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 309695 Sodium:solute symporter family 309697 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 309713 Ribosomal protein S26e 309715 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309722 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 309722 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 309723 CTF/NF-I family 309728 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 309733 jmjC domain 309738 jmjC domain 309743 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 309745 Vinculin family 309747 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 309749 Vinculin family 309751 Vinculin family 309753 Vinculin family 309754 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 309757 Sir2 family 309758 Regulator of chromosome condensation (RCC1) 309758 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 309760 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 309763 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 309763 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 309766 Ribosomal protein L21e 309767 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 309771 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 309771 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 309773 CUB domain 309773 LCCL domain 309774 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 309778 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 309786 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 309788 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 309794 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 309796 Homeobox domain 309799 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 309800 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 309804 Protein kinase domain 309805 Protein kinase domain 309811 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 309812 DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold 309812 DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities 309821 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309822 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 309828 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 309830 Homeobox domain 309831 Homeobox domain 309833 Core histone H2A/H2B/H3/H4 309834 LIM domain. This family represents two copies of the LIM structural domain 309837 Thrombospondin type 1 domain 309837 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 309837 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 309840 Ribosomal protein S7e 309846 AFG1-like ATPase. This family of proteins contains a P-loop motif and are predicted to be ATPases 309848 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 309848 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 309852 Cyclophilin type peptidyl-prolyl cis-trans isomerase 309854 WD domain, G-beta repeat 309855 SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin 309858 pfam02889, Sec63, Sec63 domain 309859 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 309868 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 309869 Autophagy protein Apg5. Apg5p is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway 309871 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 309871 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 309883 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 309884 Proteasome A-type and B-type 309887 pfam02889, Sec63, Sec63 domain 309887 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 309887 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 309891 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 309892 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 309893 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 309895 Class I Histocompatibility antigen, domains alpha 1 and 2 309896 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309897 Protein kinase domain 309898 Protein kinase domain 309900 Tub family 309901 Fibronectin type III domain 309902 CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins 309904 Homeobox domain 309905 Homeobox domain 309906 Homeobox domain 309909 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 309911 Amidase 309912 Zinc-binding dehydrogenase 309915 ENV polyprotein (coat polyprotein) 309918 Occludin/ELL family 309919 2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HCC 309922 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 309922 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 309923 Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines 309927 Ribosomal protein L24e 309928 C2 domain 309940 Nucleoside diphosphate kinase 309941 Mitochondrial carrier protein 309946 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 309949 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 309957 SRF-type transcription factor (DNA-binding and dimerisation domain) 309959 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 309962 3'5'-cyclic nucleotide phosphodiesterase 309962 PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels 309969 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 309975 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 309977 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 309981 TMS membrane protein/tumour differentially expressed protein (TDE) 309985 DNA mismatch repair proteins, mutS family 309992 Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole 309998 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 309999 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 309999 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 310000 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 310004 Succinate dehydrogenase cytochrome b subunit 310005 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 310011 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 310011 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 310019 WD domain, G-beta repeat 310029 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 310030 Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction 310033 Occludin/ELL family 310034 Rad17 cell cycle checkpoint protein 310034 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 310053 Cyclophilin type peptidyl-prolyl cis-trans isomerase 310056 Importin-beta N-terminal domain 310058 Ribosomal protein L23 310071 WD domain, G-beta repeat 310074 GNS1/SUR4 family 310074 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 310086 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 310086 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 310086 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 310090 Fibronectin type III domain 310090 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 310090 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 310091 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310091 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 310092 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310096 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 310097 PX domain. PX domains bind to phosphoinositides 310097 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 310098 Translation initiation factor SUI1 310106 ADP-ribosylation factor family 310126 Sushi domain (SCR repeat) 310126 Low-density lipoprotein receptor domain class A 310126 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 310132 Fibronectin type III domain 310139 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 310141 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 310144 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310151 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 310160 RNase3 domain 310169 HMG (high mobility group) box 310173 Cadherin domain 310174 Cadherin domain 310174 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 310178 PH domain. PH stands for pleckstrin homology 310178 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 310179 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 310180 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 310182 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310183 Ribosomal protein L6 310192 PH domain. PH stands for pleckstrin homology 310192 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 310192 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 310192 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 310193 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 310194 MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases 310200 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 310201 Dienelactone hydrolase family 310209 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 310210 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 310212 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310213 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310214 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310215 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310216 Clathrin adaptor complex small chain 310216 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 310218 Eukaryotic-type carbonic anhydrase 310223 Ribosomal protein L34e 310228 Ribosomal protein L31e 310228 Eukaryotic protein of unknown function, DUF279 310230 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 310236 HMG (high mobility group) box 310242 Ribosomal protein L34e 310242 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 310243 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 310245 Ribosomal protein L36e 310247 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 310250 Homeobox domain 310256 Eukaryotic initiation factor 4E 310256 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 310257 Glutathione peroxidase 310257 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 310261 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 310273 Fibronectin type III domain 310277 Carboxylesterase 310281 Carboxylesterase 310284 Ribosomal protein L6 310285 Ribosomal protein L6 310298 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 310302 Calcium-activated potassium channel, beta subunit 310303 Calcium-activated potassium channel, beta subunit 310305 Ubiquitin carboxyl-terminal hydrolase family 2 310306 Ubiquitin carboxyl-terminal hydrolases family 2 310306 Zn-finger in ubiquitin-hydrolases and other protein 310306 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 310312 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310317 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 310318 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310322 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310324 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 310325 Core histone H2A/H2B/H3/H4 310325 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 310326 Sulfatase 310326 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 310326 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 310327 7 transmembrane receptor (rhodopsin family) 310334 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310334 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310339 ENV polyprotein (coat polyprotein) 310341 Cadherin domain 310341 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 310344 Protein kinase domain 310352 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 310353 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 310354 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 310358 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 310360 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 310360 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 310360 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 310361 Coenzyme A transferase 310363 F-box domain 310365 Ribosomal protein L21e 310366 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310366 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310369 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 310369 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310375 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310376 Protein kinase domain 310378 Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases 310383 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 310385 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 310388 Ribosomal protein L31e 310389 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 310390 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 310392 Amino acid permease 310395 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 310401 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 310410 Ribosomal protein L21e 310426 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 310427 WD domain, G-beta repeat 310429 MCM2/3/5 family 310433 Ribosomal protein L11, RNA binding domain 310433 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 310434 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310436 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 310437 Ribosomal protein S15 310437 TSC-22/dip/bun family 310440 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 310442 7 transmembrane receptor (rhodopsin family) 310443 7 transmembrane receptor (rhodopsin family) 310444 7 transmembrane receptor (rhodopsin family) 310448 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310448 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 310449 Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 310450 7 transmembrane receptor (rhodopsin family) 310451 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 310454 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 310460 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 310461 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 310461 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 310462 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 310463 C2 domain 310463 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 310463 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 310464 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 310465 7 transmembrane receptor (metabotropic glutamate family) 310465 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 310467 WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems 310467 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 310471 PH domain. PH stands for pleckstrin homology 310480 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310480 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310487 7 transmembrane receptor (rhodopsin family) 310499 Ribosomal protein L21e 310506 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 310508 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 310509 ATP synthase alpha/beta chain, C terminal domain 310509 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 310509 pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 310510 Aminotransferase class-V 310511 WD domain, G-beta repeat 310514 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310514 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310515 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 310516 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310516 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310517 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 310519 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 310521 Enolase, N-terminal domain 310521 Enolase, C-terminal TIM barrel domain 310529 Eukaryotic porin 310531 Hsp90 protein 310531 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 310533 Cyclic nucleotide-binding domain 310533 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 310533 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 310533 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 310542 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 310545 HMG (high mobility group) box 310545 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310548 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310552 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 310552 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 310553 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 310553 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 310563 Polyprenyl synthetase 310565 Trypsin 310574 Actinobacillus constitutively-expressed outer membrane lipoprotein A 310575 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 310576 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 310579 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 310580 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 310585 Enolase, N-terminal domain 310585 Enolase, C-terminal TIM barrel domain 310585 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310585 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310587 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 310589 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310589 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310590 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310591 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310591 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310592 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310592 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 310592 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310592 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310593 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310594 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310595 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310595 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310596 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310596 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310596 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310597 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310597 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310598 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310598 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310600 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310601 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 310601 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310601 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310602 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 310603 Dihydrofolate reductase 310606 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 310606 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 310607 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 310609 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 310611 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 310611 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 310615 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 310615 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 310615 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 310617 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310617 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310625 Ammonium Transporter Family 310628 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 310629 ADP-ribosylation factor family 310629 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 310630 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 310630 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 310632 GTPase of unknown function 310632 ADP-ribosylation factor family 310632 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 310633 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 310635 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 310636 7 transmembrane receptor (rhodopsin family) 310637 ADP-ribosylation factor family 310637 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 310638 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 310639 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 310639 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 310641 B-box zinc finger 310641 Fibronectin type III domain 310641 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 310642 Ephrin 310643 Ephrin 310645 Phosphomevalonate kinase. Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This family represents the animal typ 310647 SH2 domain 310647 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 310648 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310648 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310650 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 310650 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 310650 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 310651 C2 domain 310651 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 310651 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 310652 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 310652 Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment 310653 Ribosomal protein L9, N-terminal domain 310654 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310657 HMG (high mobility group) box 310658 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 310659 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 310664 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 310664 DHH family. It is predicted that this family of proteins all perform a phosphoesterase function. It included the single stranded DNA exonuclease RecJ 310664 DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at 310666 Annexin. This family of annexins also includes giardin that has been shown to function as an annexin 310667 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 310668 Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has 310668 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 310672 Eukaryotic-type carbonic anhydrase 310672 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 310672 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 310673 Ribosomal protein L6 310674 PH domain. PH stands for pleckstrin homology 310675 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 310678 Core histone H2A/H2B/H3/H4 310679 Core histone H2A/H2B/H3/H4 310680 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 310683 von Willebrand factor type A domain 310683 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 310684 NUDIX domain 310688 SH2 domain 310691 Ets-domain 310693 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 310694 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 310697 Doublecortin 310698 Protein kinase domain 310700 Beige/BEACH domain 310701 Mab-21 protein 310706 Ribosomal L38e protein family 310707 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 310707 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 310710 Synaptobrevin 310713 Core histone H2A/H2B/H3/H4 310715 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 310715 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 310716 tRNA synthetases class I (W and Y) 310716 Ribosomal Proteins L2, C-terminal domain 310716 Ribosomal Proteins L2, RNA binding domain 310718 tRNA synthetases class I (W and Y) 310720 WD domain, G-beta repeat 310720 Dip2/Utp12 Family. This domain is found at the C-terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein the yeast protein is called Utp12 or DIP2 310724 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 310727 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310728 Ribosomal protein L21e 310729 Cyclophilin type peptidyl-prolyl cis-trans isomerase 310732 Sugar (and other) transporter 310738 Nerve growth factor family 310740 'Cold-shock' DNA-binding domain 310741 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 310742 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 310743 Olfactomedin-like domain 310745 Metallo-beta-lactamase superfamily 310746 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 310747 Ribosomal protein L44 310753 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 310753 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 310756 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t 310758 Ribosomal protein L34e 310758 F-actin capping protein alpha subunit 310760 Ribosomal protein L36e 310760 Uncharacterized ACR, COG1579 310760 Intermediate filament protein 310760 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 310760 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 310760 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 310760 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 310761 Ribosomal protein S12 310765 Ribosomal protein L21e 310769 WD domain, G-beta repeat 310771 Glycosyl hydrolases family 18 310772 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 310772 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 310772 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 310773 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 310778 Homeobox domain 310779 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 310779 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 310780 7 transmembrane receptor (rhodopsin family) 310781 Cytochrome b561 310781 Synaptophysin / synaptoporin 310781 Synaptogyrin. This family of proteins is distantly related to pfam01284 310782 Fibronectin type III domain 310784 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 310786 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310786 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310787 Fibronectin type III domain 310787 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about 310791 Mitochondrial carrier protein 310791 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 310797 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 310797 Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str 310801 Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells 310803 Ribosomal protein L13e 310806 Protein-tyrosine phosphatase 310806 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 310807 Uncharacterized ACR, COG1579 310807 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 310807 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 310808 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 310810 Reeler domain 310810 DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions 310811 Paralemmin 310812 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 310815 PX domain. PX domains bind to phosphoinositides 310819 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310820 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310825 Ribosomal protein S19e 310825 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 310826 HMG (high mobility group) box 310827 Cyclophilin type peptidyl-prolyl cis-trans isomerase 310833 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 310833 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 310838 SH2 domain 310838 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 310839 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 310839 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 310842 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 310845 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 310846 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 310848 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 310848 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 310855 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 310856 Inorganic pyrophosphatase 310861 Ribosomal L10 310863 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 310864 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 310864 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 310869 7 transmembrane receptor (rhodopsin family) 310870 ENV polyprotein (coat polyprotein) 310871 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 310877 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 310879 MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b 310886 Sulfatase 310888 Hsp90 protein 310891 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 310891 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 310893 Ribosomal protein S24e 310894 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 310894 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 310898 SH2 domain 310899 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 310899 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310900 Thymosin beta-4 family 310900 Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitelli 310902 Zinc-binding dehydrogenase 310903 Zinc-binding dehydrogenase 310908 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 310909 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 310914 Protein kinase domain 310914 Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t 310917 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 310917 Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 310922 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310923 WD domain, G-beta repeat 310928 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 310932 LIM domain. This family represents two copies of the LIM structural domain 310933 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 310945 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 310954 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f 310955 Adenylate kinase 310956 Adenylate kinase 310957 Adenylate kinase 310958 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 310960 Ubiquitin carboxyl-terminal hydrolase family 2 310962 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 310964 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 310975 Ribosomal protein L21e 310982 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 310986 Protein kinase domain 310987 Protein kinase C terminal domain 310988 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 310991 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 310996 Helix-loop-helix DNA-binding domain 310997 7 transmembrane receptor (rhodopsin family) 310999 Death domain 310999 PH domain. PH stands for pleckstrin homology 310999 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 311003 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 311003 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311005 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 311005 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311006 ENV polyprotein (coat polyprotein) 311006 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 311006 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 311006 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311009 Zinc finger, C3HC4 type (RING finger) 311013 HIT family 311013 Sulfatase 311014 L1 transposable element 311014 Ribosomal protein L13e 311015 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 311019 Ribosomal L28e protein family 311022 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 311022 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 311028 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311030 VHS domain. Domain present in VPS-27, Hrs and STAM 311030 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 311038 Ribosomal protein S5, C-terminal domain 311038 Ribosomal protein S5, N-terminal domain 311039 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 311041 L1 transposable element 311043 ENV polyprotein (coat polyprotein) 311046 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311046 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311049 Occludin/ELL family 311050 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311050 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311055 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 311060 Fibronectin type II domain 311060 Lectin C-type domain. This family includes both long and short form C-type 311061 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 311065 Ribosomal protein S8 311080 Ribosomal protein L6 311082 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 311082 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 311091 Ribosomal protein L15 311091 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 311095 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311096 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 311105 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 311110 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 311111 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 311114 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 311114 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311115 Putative zinc finger in N-recognin 311120 Ribosomal L15 311121 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311121 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311122 Mitochondrial carrier protein 311130 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 311144 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 311147 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 311151 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 311159 ENV polyprotein (coat polyprotein) 311164 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311166 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity 311167 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 311169 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 311171 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 311172 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 311173 7 transmembrane receptor (rhodopsin family) 311175 7 transmembrane receptor (rhodopsin family) 311176 7 transmembrane receptor (rhodopsin family) 311176 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 311179 7 transmembrane receptor (rhodopsin family) 311180 7 transmembrane receptor (rhodopsin family) 311183 Ribosomal protein S26e 311184 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 311185 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311187 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 311187 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 311190 Low molecular weight phosphotyrosine protein phosphatase 311193 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 311197 HMG (high mobility group) box 311202 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 311210 Tetraspanin family 311211 Tetraspanin family 311215 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mos 311218 Aminotransferase class I and II 311220 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 311228 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311236 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 311240 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311241 Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos 311245 TRAF-type zinc finger 311245 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 311245 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 311250 B-box zinc finger 311252 Trypsin 311252 CUB domain 311252 Sushi domain (SCR repeat) 311253 Ribosomal protein S8e 311254 e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit 311254 2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain 311255 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 311256 Ribosomal protein S11 311257 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 311258 Ribosomal protein S6e 311259 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 311260 LIM domain. This family represents two copies of the LIM structural domain 311265 Ribosomal protein S21e 311265 Ribosomal protein L21e 311271 Signal peptidase I 311276 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 311278 HIT family 311281 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311289 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 311295 Proteasome A-type and B-type 311297 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311307 Domain of unknown function DUF71. This family of proteins have no known function. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.In some members of this family, this domain is associated with pfam01042 311322 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 311324 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 311326 RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to 311329 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 311330 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 311331 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 311332 Delta serrate ligand 311334 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin 311336 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 311337 Protein kinase domain 311341 C2 domain 311341 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 311342 C2 domain 311342 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 311342 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 311343 C2 domain 311343 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 311344 S25 ribosomal protein 311345 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 311352 Adenosine/AMP deaminase 311352 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 311353 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 311354 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 311358 TMS membrane protein/tumour differentially expressed protein (TDE) 311362 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 311368 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 311373 Zinc finger, C3HC4 type (RING finger) 311373 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 311375 SH2 domain 311382 Ribosomal protein S5, C-terminal domain 311384 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 311387 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 311388 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311390 Ribosomal protein S11 311391 SH2 domain 311391 Intermediate filament protein 311391 Phosphotyrosine interaction domain (PTB/PID) 311391 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 311391 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 311396 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 311397 AMP-binding enzyme 311399 Ubiquitin carboxyl-terminal hydrolase family 2 311399 Ubiquitin carboxyl-terminal hydrolases family 2 311401 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 311401 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 311402 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 311404 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 311406 Protein-tyrosine phosphatase 311406 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 311416 Vinculin family 311418 Ribosomal L22e protein family 311419 Protein kinase domain 311421 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311422 Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and C. elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in S. cerevisiae. The HAM1 protein protects the cell from HAP, eithe 311424 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 311425 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 311425 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 311426 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 311427 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 311431 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 311437 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 311441 Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3 311442 CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond 311443 Fibronectin type III domain 311446 FMN-dependent dehydrogenase 311446 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 311446 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 311446 Conserved region in glutamate synthase. This family represents a region of the glutamate synthase protein. This region is expressed as a separate subunit in the glutamate synthase alpha subunit from archaebacteria, or part of a large multidomain enzyme in 311450 Protein kinase domain 311451 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 311456 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 311462 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311463 ENV polyprotein (coat polyprotein) 311464 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 311468 Asparaginase 311469 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311469 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311471 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 311474 Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi 311475 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311475 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311477 CTF/NF-I family 311478 PX domain. PX domains bind to phosphoinositides 311480 Kinesin motor domain 311483 Spumavirus gag protein 311483 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 311483 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 311491 Papain family cysteine protease 311491 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 311498 Rap/ran-GAP 311503 Homeobox domain 311504 Enolase, N-terminal domain 311504 Enolase, C-terminal TIM barrel domain 311505 'Paired box' domain 311508 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311510 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311511 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 311512 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 311513 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 311514 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 311515 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 311517 Core histone H2A/H2B/H3/H4 311517 Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This 311521 Ribonuclease T2 family 311523 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311525 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 311526 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311527 Protein kinase domain 311527 Apoptosis regulator proteins, Bcl-2 family 311528 Uncharacterised protein family (UPF0080) 311529 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 311529 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 311530 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 311532 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 311538 HMG (high mobility group) box 311542 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 311544 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 311544 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 311545 Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225 311547 Fork head domain 311547 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 311547 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt 311548 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 311557 C-5 cytosine-specific DNA methylase 311557 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 311558 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 311558 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 311560 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 311561 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 311564 Eukaryotic protein of unknown function, DUF279 311565 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311566 Ribosomal protein L11, RNA binding domain 311566 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 311567 C2 domain 311567 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 311569 AMP-binding enzyme 311570 Myosin head (motor domain) 311570 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 311570 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 311570 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 311571 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311571 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311574 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311574 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311575 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 311577 SH2 domain 311577 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 311577 Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to 311578 Intermediate filament protein 311578 Ribosomal protein S5, C-terminal domain 311578 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 311580 HD domain. HD domains are metal dependent phosphohydrolases 311582 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 311587 Uncharacterised protein family (UPF0041) 311604 Homeobox domain 311605 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311607 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 311607 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 311607 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 311609 Fibronectin type III domain 311612 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 311613 Zinc finger, C2HC type. This is a DNA binding zinc finger domain 311613 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 311614 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 311615 HMG (high mobility group) box 311616 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 311618 SCP-like extracellular protein. This domain is also found in prokaryotes 311621 TPR Domain 311622 Protein kinase domain 311626 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 311627 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 311628 Ribosomal protein L21e 311629 PX domain. PX domains bind to phosphoinositides 311639 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311641 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 311642 Sulfatase 311642 Protein of unknown function (DUF229). Members of this family are uncharacterised. They are 500-1200 amino acids in length and share a long region conservation that probably corresponds to several domains 311647 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 311647 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 311655 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 311655 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 311667 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 311667 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311670 WD domain, G-beta repeat 311671 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 311672 Syntaxin 311674 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 311674 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 311675 ADP-ribosylation factor family 311675 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 311678 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 311681 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311685 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311687 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311688 ENV polyprotein (coat polyprotein) 311688 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311689 Endothelin family 311691 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 311693 Cadherin domain 311701 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 311715 Ribosomal protein L21e 311715 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 311716 Fibronectin type III domain 311716 von Willebrand factor type A domain 311716 Thrombospondin N-terminal -like domain 311716 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 311720 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 311721 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 311723 HMG (high mobility group) box 311724 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311726 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 311727 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311727 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 311730 Auxin Efflux Carrier 311730 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 311731 Peptidase family U34 311735 GTP1/OBG family 311735 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 311738 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 311740 Ribosomal L39 protein 311741 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 311745 Cyclic nucleotide-binding domain 311748 metallopeptidase family M24 311751 Adenylate kinase 311756 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 311758 Uncharacterized ACR, COG1579 311758 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 311758 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 311759 Thrombospondin type 1 domain 311761 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 311765 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 311767 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311768 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 311775 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 311776 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 311777 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 311781 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 311782 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 311783 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 311784 Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18 311785 PH domain. PH stands for pleckstrin homology 311785 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 311786 TRAF-type zinc finger 311787 TRAF-type zinc finger 311788 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 311789 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 311791 MYND finger 311792 GDA1/CD39 (nucleoside phosphatase) family 311793 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 311794 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 311795 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 311799 Flavodoxin 311799 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 311799 FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase 311802 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 311803 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 311807 Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invol 311809 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 311810 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 311811 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 311812 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 311816 Ribosomal Proteins L2, C-terminal domain 311816 Ribosomal Proteins L2, RNA binding domain 311821 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 311822 Ribosomal protein L6 311825 Protein kinase domain 311826 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 311827 Thrombospondin type 1 domain 311833 Adenylate kinase 311839 AMP-binding enzyme 311843 Uncharacterized ACR, COG2106 311844 Aminotransferase class I and II 311844 DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC and StsS. The aminotransferase activity 311844 Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH 311845 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 311845 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 311853 ubiE/COQ5 methyltransferase family 311853 O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine 311856 Ubiquitin carboxyl-terminal hydrolase family 2 311856 Ubiquitin carboxyl-terminal hydrolases family 2 311856 Zn-finger in ubiquitin-hydrolases and other protein 311858 Arginosuccinate synthase. This family contains a PP-loop motif 311860 SH2 domain 311860 Protein kinase domain 311860 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 311861 Fibrinogen beta and gamma chains, C-terminal globular domain 311862 Laminin B (Domain IV) 311862 Intermediate filament protein 311862 Laminin N-terminal (Domain VI) 311862 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 311862 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 311862 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a 311864 Phosphoribulokinase / Uridine kinase family 311868 Rap/ran-GAP 311870 PH domain. PH stands for pleckstrin homology 311872 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311876 Homeobox domain 311876 PBX domain. The PBX domain is a bipartite acidic domain 311878 PBX domain. The PBX domain is a bipartite acidic domain 311880 Vacuolar sorting protein 9 (VPS9) domain 311880 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 311881 WD domain, G-beta repeat 311882 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 311883 CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide vari 311885 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 311885 Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo 311886 Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element 311889 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 311898 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 311901 Homeobox domain 311901 LIM domain. This family represents two copies of the LIM structural domain 311902 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 311903 Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round 311905 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 311906 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 311907 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 311909 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 311910 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 311910 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 311911 7 transmembrane receptor (rhodopsin family) 311911 Phosphotyrosine interaction domain (PTB/PID) 311914 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 311914 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 311914 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 311917 Protein kinase domain 311918 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 311919 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 311923 Cyclophilin type peptidyl-prolyl cis-trans isomerase 311923 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 311926 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 311934 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 311936 IQ calmodulin-binding motif. Calmodulin-binding motif 311942 Homeobox domain 311944 HMG (high mobility group) box 311944 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 311950 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 311952 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 311955 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 311956 WD domain, G-beta repeat 311957 NAD-dependent glycerol-3-phosphate dehydrogenase 311959 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 311965 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 311968 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 311968 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 311973 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 311976 Zinc-binding dehydrogenase 311977 ADP-ribosylation factor family 311977 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 311980 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 311996 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 312000 Myosin head (motor domain) 312003 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 312007 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312008 Myosin head (motor domain) 312009 Myosin head (motor domain) 312011 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 312015 Myosin head (motor domain) 312020 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 312020 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312021 Myosin head (motor domain) 312024 Myosin head (motor domain) 312032 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 312034 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 312035 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312036 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 312049 Uncharacterized protein family UPF0036 312049 NADP oxidoreductase coenzyme F420-dependent 312051 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312052 NADP oxidoreductase coenzyme F420-dependent 312053 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 312053 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 312054 GTP1/OBG family 312054 GTPase of unknown function 312054 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 312054 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 312067 mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein 312068 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 312069 Protein kinase domain 312071 Protein kinase domain 312075 Protein kinase domain 312076 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312077 Intermediate filament protein 312078 Apoptosis regulator proteins, Bcl-2 family 312086 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 312086 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H 312096 Homeobox domain 312100 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 312102 HSF-type DNA-binding 312103 Hsp90 protein 312105 eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 312105 eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known. 312108 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 312113 START domain 312113 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 312115 von Willebrand factor type A domain 312115 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 312115 Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino 312115 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 312120 Ribosomal family S4e 312127 Thrombospondin type 1 domain 312130 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 312136 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312140 Fork head domain 312142 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312148 wnt family 312149 wnt family 312150 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 312150 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 312152 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 312157 wnt family 312162 Ribosomal protein S5, C-terminal domain 312167 Zinc finger, C3HC4 type (RING finger) 312167 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 312168 Ribosomal protein L36e 312170 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 312171 Hyaluronidase 312174 Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Al 312178 Core histone H2A/H2B/H3/H4 312179 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 312182 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312186 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 312189 Tetraspanin family 312191 Ribosomal protein L19e 312192 S-adenosyl-L-homocysteine hydrolase 312192 TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD 312192 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 312199 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 312201 Zinc carboxypeptidase 312201 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 312202 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 312207 Translation initiation factor SUI1 312210 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 312210 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 312213 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 312217 HMG (high mobility group) box 312221 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 312222 Ribosomal protein L21e 312225 WD domain, G-beta repeat 312226 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict 312227 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312228 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 312240 HMG (high mobility group) box 312242 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 312243 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 312245 Sugar (and other) transporter 312248 Ribosomal protein L14p/L23e 312250 Protein of unknown function, DUF259 312251 Protein of unknown function, DUF259 312255 Sugar (and other) transporter 312256 ADP-ribosylation factor family 312256 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 312257 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 312257 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 312257 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 312258 ABC1 family. This family includes ABC1 from yeast and AarF from E. coli. These proteins have a nuclear or mitochondrial subcellular location in eukaryotes. The exact molecular functions of these proteins is not clear, however yeast ABC1 suppresses a cytoc 312260 Protein kinase domain 312261 Thymosin beta-4 family 312262 Protein kinase domain 312263 Raf-like Ras-binding domain 312263 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 312263 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 312269 Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues 312273 Trypsin 312275 Protein kinase domain 312275 Fibronectin type III domain 312275 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 312275 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 312279 Protein kinase domain 312279 Fibronectin type III domain 312279 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 312279 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 312281 7 transmembrane receptor (rhodopsin family) 312282 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 312282 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 312288 ENV polyprotein (coat polyprotein) 312299 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 312301 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 312301 ATP synthase, Delta/Epsilon chain, beta-sandwich domain. Part of the ATP synthase CF(1). These subunits are part of the head unit of the ATP synthase. The subunits are called delta and epsilon in human and metazoan species but in bacterial species the del 312302 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 312302 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 312303 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 312303 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 312303 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 312312 GTPase of unknown function 312314 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 312315 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 312316 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 312316 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 312317 Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 312317 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous 312317 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 312319 Ribosomal L29e protein family 312320 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 312320 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312321 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312325 Guanylate kinase 312327 Ribosomal protein L15 312331 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 312331 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 312333 Ribosomal protein L21e 312335 Homeobox domain 312336 Homeobox domain 312337 Homeobox domain 312338 Homeobox domain 312339 Homeobox domain 312343 Ribosomal L15 312346 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312350 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 312351 Serine carboxypeptidase 312357 SH2 domain 312359 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 312360 FKBP-type peptidyl-prolyl cis-trans isomerase 312361 PH domain. PH stands for pleckstrin homology 312363 Ribosomal protein L11, RNA binding domain 312363 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 312370 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 312372 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 312374 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312376 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 312377 7 transmembrane receptor (rhodopsin family) 312377 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 312379 7 transmembrane receptor (rhodopsin family) 312379 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 312380 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 312381 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 312382 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 312384 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 312385 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 312389 Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid p 312391 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 312391 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 312393 HIT family 312398 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 312398 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 312400 7 transmembrane receptor (rhodopsin family) 312403 HMG (high mobility group) box 312407 Fibronectin type III domain 312408 Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an 312409 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312411 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312413 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312415 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312416 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312418 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312419 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312420 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312421 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312423 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312424 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312425 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 312426 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312429 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312430 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312430 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312431 Fibrillarin 312431 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312436 Low molecular weight phosphotyrosine protein phosphatase 312438 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 312438 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 312440 jmjC domain 312440 jmjC domain 312442 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 312451 HMG (high mobility group) box 312465 Vinculin family 312477 PTB domain (IRS-1 type) 312478 Lysyl oxidase 312478 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 312479 Homeobox domain 312480 Zinc finger, C3HC4 type (RING finger) 312483 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 312483 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 312488 ATP synthase alpha/beta chain, C terminal domain 312488 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 312488 pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 312489 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 312492 C2 domain 312495 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 312495 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 312498 Exocyst complex subunit Sec15-like 312498 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 312501 Protein kinase domain 312502 C2 domain 312505 'Cold-shock' DNA-binding domain 312505 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 312507 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 312510 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312514 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 312516 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 312519 Protein kinase domain 312522 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 312526 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 312527 Uncharacterized ACR, COG2135 312529 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 312530 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 312532 Ribosomal protein L21e 312533 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 312535 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 312535 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 312536 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 312536 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 312537 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 312538 MCM2/3/5 family 312542 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 312544 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 312544 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 312545 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 312548 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 312549 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 312552 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 312558 wnt family 312560 DNA repair protein Rad4 312563 LIM domain. This family represents two copies of the LIM structural domain 312566 Thrombospondin type 1 domain 312566 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 312566 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 312568 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 312574 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312575 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312585 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 312589 Helix-loop-helix DNA-binding domain 312600 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 312601 Macrophage migration inhibitory factor (MIF) 312607 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 312607 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 312611 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 312612 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 312616 Poly A polymerase family. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of 312617 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312617 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 312622 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 312626 Ribosomal protein S6e 312630 DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons 312636 CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain 312638 Giardia variant-specific surface protein 312647 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 312649 tRNA intron endonuclease, catalytic C-terminal domain. Members of this family cleave pre tRNA at the 5' and 3' splice sites to release the intron EC:3.1.27.9 312651 WD domain, G-beta repeat 312652 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 312652 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 312652 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 312657 7 transmembrane receptor (rhodopsin family) 312659 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 312663 Core histone H2A/H2B/H3/H4 312664 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 312666 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 312667 Intermediate filament protein 312667 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 312667 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 312667 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 312667 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 312667 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 312667 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 312668 Cache domain 312670 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 312674 wnt family 312676 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 312678 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 312681 Mitochondrial carrier protein 312681 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 312682 Apoptosis regulator proteins, Bcl-2 family 312686 Ubiquitin carboxyl-terminal hydrolase family 2 312687 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 312689 Cation-dependent mannose-6-phosphate receptor 312690 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 312695 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 312696 Lectin C-type domain. This family includes both long and short form C-type 312698 Lectin C-type domain. This family includes both long and short form C-type 312700 Trypsin 312700 CUB domain 312700 Sushi domain (SCR repeat) 312701 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 312704 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 312706 Suppressor Mra1. The suppressor Mra1 is found in high-copy-number when Ras1 is mutated, that recovers the mating deficiency caused by the decrease of Ras1 activity. Mutational analysis in yeast suggests that the suppressor Mra1 is essential for cell growt 312710 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 312714 von Willebrand factor type D domain 312721 Protein kinase domain 312723 Ribosomal protein S7e 312727 Tetraspanin family 312728 Tetraspanin family 312736 TEA/ATTS domain family 312738 TEA/ATTS domain family 312739 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 312740 Tub family 312742 Lectin C-type domain. This family includes both long and short form C-type 312744 Lectin C-type domain. This family includes both long and short form C-type 312745 Lectin C-type domain. This family includes both long and short form C-type 312748 Lectin C-type domain. This family includes both long and short form C-type 312750 'chromo' (CHRromatin Organization MOdifier) domain 312750 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 312751 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 312752 Lectin C-type domain. This family includes both long and short form C-type 312754 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 312756 Lectin C-type domain. This family includes both long and short form C-type 312758 Lectin C-type domain. This family includes both long and short form C-type 312759 Lectin C-type domain. This family includes both long and short form C-type 312760 Lectin C-type domain. This family includes both long and short form C-type 312762 Lectin C-type domain. This family includes both long and short form C-type 312763 Lectin C-type domain. This family includes both long and short form C-type 312777 Ets-domain 312777 Sterile alpha motif (SAM)/Pointed domain 312782 Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure 312787 7 transmembrane receptor (rhodopsin family) 312790 7 transmembrane receptor (metabotropic glutamate family) 312791 7 transmembrane receptor (metabotropic glutamate family) 312792 Ribosomal S17 312803 Core histone H2A/H2B/H3/H4 312804 Core histone H2A/H2B/H3/H4 312806 NAD:arginine ADP-ribosyltransferase 312807 RHO protein GDP dissociation inhibitor 312808 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312811 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 312812 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 312816 Pentapeptide repeats (8 copies). These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknow 312818 LIM domain. This family represents two copies of the LIM structural domain 312824 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 312824 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 312826 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-oct 312827 Sulfotransferase protein 312828 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 312828 Domain of unknown function (DUF227). This family includes a large number of drosophila proteins of unknown function. The family also includes several C. elegans proteins. The alignment contains many histidines and aspartates that are conserved, suggesting 312828 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 312839 Intermediate filament tail domain 312843 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 312843 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 312850 Protein kinase domain 312851 Helix-loop-helix DNA-binding domain 312855 Uncharacterized ACR, COG1579 312855 Intermediate filament protein 312855 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 312855 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 312855 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 312858 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 312858 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 312860 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 312866 Ribosomal family S4e 312869 Ribosomal L15 312872 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 312873 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 312874 Myosin head (motor domain) 312875 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 312879 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 312882 N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere 312882 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 312883 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 312884 Homeobox domain 312885 Lectin C-type domain. This family includes both long and short form C-type 312887 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 312890 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 312890 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 312893 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 312896 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 312896 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 312897 Helix-loop-helix DNA-binding domain 312903 Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation 312903 Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis 312906 ENV polyprotein (coat polyprotein) 312906 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312907 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 312907 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 312912 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 312912 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 312913 Zinc carboxypeptidase 312913 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol 312915 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 312916 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 312917 Helix-loop-helix DNA-binding domain 312919 Iron-containing alcohol dehydrogenase 312921 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312923 PH domain. PH stands for pleckstrin homology 312923 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 312924 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 312925 TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway 312930 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 312932 Transcription factor S-II (TFIIS) 312933 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 312936 HMG (high mobility group) box 312937 Doublecortin 312938 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 312938 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 312939 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 312940 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 312946 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 312946 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 312947 Zinc-binding dehydrogenase 312947 Conserved hypothetical protein 95 312947 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 312947 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 312947 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 312948 Ribosomal protein L19e 312949 Fork head domain 312950 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 312952 Inositol monophosphatase family 312959 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 312959 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 312964 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 312965 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 312967 Beige/BEACH domain 312967 WD domain, G-beta repeat 312967 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 312974 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 312974 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 312974 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 312979 CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650 312980 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 312987 Ribosomal protein L31e 312990 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 312993 Ribosomal protein L24e 312994 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 312996 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 312999 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 313000 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313000 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313001 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 313008 Ribosomal protein L44 313011 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 313011 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 313013 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313013 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313015 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 313017 14-3-3 protein 313017 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 313020 WD domain, G-beta repeat 313030 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 313034 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 313035 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 313038 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 313039 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 313039 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 313039 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 313041 HMG (high mobility group) box 313045 Intermediate filament protein 313045 KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif 313045 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 313045 PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins 313045 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 313047 tRNA synthetases class I (W and Y) 313047 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 313048 WD domain, G-beta repeat 313049 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313050 SH2 domain 313050 Protein kinase domain 313050 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 313051 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313052 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 313056 WD domain, G-beta repeat 313057 TMS membrane protein/tumour differentially expressed protein (TDE) 313058 Thrombospondin type 1 domain 313058 7 transmembrane receptor (rhodopsin family) 313058 7 transmembrane receptor (Secretin family) 313058 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 313058 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 313058 Paramyxovirus P/V phosphoprotein. Paramyxoviral P genes are able to generate more than one product, using alternative reading frames and RNA editing. The P gene encodes the structural phosphoprotein P. In addition, it encodes several non-structural protei 313063 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 313063 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 313063 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 313069 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313070 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313074 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313074 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313076 MYND finger 313087 C2 domain 313089 Amino acid permease 313090 ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae 313093 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 313098 Ubiquitin carboxyl-terminal hydrolase family 2 313099 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 313101 Ribosomal protein L34e 313109 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 313110 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 313111 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 313112 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313114 Ribosomal S3Ae family 313121 Protein kinase domain 313125 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 313126 Connexin 313129 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 313130 Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases 313131 mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme 313131 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 313132 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 313133 mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme 313134 Ribosomal protein L31e 313139 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 313141 L1 transposable element 313142 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 313149 ENV polyprotein (coat polyprotein) 313149 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 313149 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 313149 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 313152 Interferon alpha/beta domain 313153 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 313156 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 313163 ADP-ribosylation factor family 313163 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 313166 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 313167 Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap 313168 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 313169 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 313169 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 313170 Kinesin motor domain 313171 WD domain, G-beta repeat 313177 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 313182 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 313183 Actin 313184 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 313191 Sushi domain (SCR repeat) 313191 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 313192 von Willebrand factor type A domain 313197 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 313200 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 313200 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 313206 HIT family 313208 Fibronectin type III domain 313208 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 313210 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 313212 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 313213 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 313213 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 313216 Zinc finger, C3HC4 type (RING finger) 313219 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313228 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 313229 Calsequestrin 313229 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 313231 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 313233 QXW lectin repeat 313233 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 313234 PH domain. PH stands for pleckstrin homology 313234 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 313235 WD domain, G-beta repeat 313236 B-box zinc finger 313236 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 313240 SH2 domain 313240 Ribosomal protein L31e 313241 Mitochondrial carrier protein 313242 WD domain, G-beta repeat 313244 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 313247 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313249 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 313254 Kinesin motor domain 313255 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 313256 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 313258 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313259 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 313264 Zinc finger, C3HC4 type (RING finger) 313267 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 313270 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 313270 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 313273 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 313274 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 313276 jmjC domain 313276 Protein kinase C terminal domain 313277 jmjC domain 313278 Fibronectin type III domain 313278 Protein-tyrosine phosphatase 313283 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313284 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 313296 WD domain, G-beta repeat 313297 HMG (high mobility group) box 313303 SH2 domain 313304 Stromal antigen (SA/STAG) protein 313310 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 313312 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313313 HMG (high mobility group) box 313313 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 313317 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 313318 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313318 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313322 Calx-beta domain 313322 DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe 313323 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 313323 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 313331 Skp1 family, tetramerisation domain 313331 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 313332 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 313333 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 313339 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 313340 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 313340 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 313340 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 313341 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 313343 YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity 313348 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 313348 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313350 Phosphorylase family 2 313352 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 313352 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 313353 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 313355 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313357 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313362 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 313364 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 313368 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 313372 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313373 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313373 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 313374 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313375 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313375 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313379 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 313382 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 313383 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 313385 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 313386 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 313387 Ubiquitin carboxyl-terminal hydrolase family 2 313387 Ubiquitin carboxyl-terminal hydrolases family 2 313389 Fibrinogen beta and gamma chains, C-terminal globular domain 313391 Peptidase family C54 313393 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313399 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 313409 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 313410 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 313410 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 313415 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 313416 WD domain, G-beta repeat 313418 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 313418 ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL 313421 Thrombospondin type 1 domain 313421 Low-density lipoprotein receptor domain class A 313421 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 313426 Ribosomal protein L21e 313429 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 313429 tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt 313438 PH domain. PH stands for pleckstrin homology 313439 Ribosomal protein S19 313443 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313443 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313444 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313445 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 313447 Ribosomal protein S26e 313449 Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3 313450 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 313451 Homeobox domain 313452 Homeobox domain 313454 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313455 Homeobox domain 313467 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313471 DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown 313471 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 313472 Enolase, N-terminal domain 313472 Enolase, C-terminal TIM barrel domain 313474 Uncharacterized ACR, COG1579 313474 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 313474 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 313476 Calreticulin family 313477 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 313478 C2 domain 313479 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 313479 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 313480 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 313484 DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai 313487 SCP-like extracellular protein. This domain is also found in prokaryotes 313491 HIT family 313492 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 313495 Eukaryotic-type carbonic anhydrase 313499 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 313504 Fork head domain 313505 Fork head domain 313507 Helix-loop-helix DNA-binding domain 313508 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313509 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 313511 Protein kinase domain 313511 Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc 313512 Homeobox domain 313513 Ribosomal protein S5, C-terminal domain 313518 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 313522 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 313524 Eukaryotic ribosomal protein L18 313525 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 313526 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313527 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 313533 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 313535 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 313535 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 313538 Transforming growth factor beta like domain 313539 jmjN domain 313539 jmjC domain 313539 Ribosomal protein L19e 313540 Protein kinase domain 313543 Protein kinase domain 313546 7 transmembrane receptor (rhodopsin family) 313547 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 313548 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 313553 Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy 313558 Fork head domain 313560 Glutamine amidotransferase class-I 313564 Peptidase family M48 313567 Palmitoyl protein thioesterase 313568 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 313569 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 313572 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 313573 Coenzyme A transferase 313575 Helix-loop-helix DNA-binding domain 313576 Coenzyme A transferase 313576 Transforming growth factor beta like domain 313576 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 313577 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 313581 Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This 313582 LIM domain. This family represents two copies of the LIM structural domain 313588 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 313592 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 313594 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 313596 Transcription factor AP-2 313599 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313600 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 313600 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 313601 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313602 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 313608 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313609 Protein kinase domain 313610 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mos 313611 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 313611 Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl 313614 Ribosomal L10 313615 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 313617 Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2) 313618 Intermediate filament protein 313618 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 313618 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 313618 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 313625 Tissue factor 313625 Fibronectin type III domain 313625 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 313626 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313629 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313632 Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti 313633 Protein kinase domain 313633 Fibronectin type III domain 313633 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 313635 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 313636 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 313637 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313641 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 313641 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 313644 Rap/ran-GAP 313647 linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types 313648 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kD subunit. Members of this family are involved in asparagine-linked protein glycosylation. In particular, dolichyl-diphosphooligosaccharide-protein glycosyltransferase (DDOST), also known as 313649 Cytidine and deoxycytidylate deaminase zinc-binding region 313652 Ribosomal protein L11, RNA binding domain 313652 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 313653 von Willebrand factor type A domain 313654 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 313658 Putative zinc finger in N-recognin 313659 Homeobox domain 313659 'Paired box' domain 313665 Hsp20/alpha crystallin family 313666 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313667 PH domain. PH stands for pleckstrin homology 313667 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 313668 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 313670 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 313672 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 313676 ribosomal L5P family C-terminus. This region is found associated with pfam00281 313679 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 313680 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 313682 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 313685 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 313687 Ribosomal family S4e 313687 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 313687 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 313688 Carboxylesterase 313689 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 313692 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313693 Ribosomal protein L6 313694 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313697 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313698 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313702 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 313703 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 313705 AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance 313705 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 313706 UbiA prenyltransferase family 313707 HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease 313707 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin 313708 Ribosomal protein S5, C-terminal domain 313714 Ribosomal L29e protein family 313721 Ribosomal protein S5, C-terminal domain 313721 Ribosomal protein S5, N-terminal domain 313722 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 313723 Ribosomal protein S5, N-terminal domain 313725 7 transmembrane receptor (Secretin family) 313726 Sugar (and other) transporter 313734 CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are asso 313735 Disulfide bond formation protein DsbB. This family consists of disulfide bond formation protein DsbB from bacteria. The DsbB protein oxidizes the periplasmic protein DsbA which in turn oxidizes cysteines in other periplasmic proteins in order to make disu 313741 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313741 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313742 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 313743 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 313743 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 313745 Death domain 313745 PH domain. PH stands for pleckstrin homology 313745 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 313748 Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this meth 313755 Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be 313756 C2 domain 313756 PH domain. PH stands for pleckstrin homology 313756 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 313756 Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit 313756 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 313760 Caldesmon 313760 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 313767 Elongation factor 1 gamma, conserved domain 313767 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 313767 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 313772 PH domain. PH stands for pleckstrin homology 313772 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 313772 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 313777 Uncharacterised protein family (UPF0120) 313779 Ribosomal protein S2 313780 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 313782 F-box domain 313783 Olfactomedin-like domain 313784 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 313786 2Fe-2S iron-sulfur cluster binding domain 313787 HMG (high mobility group) box 313790 Uncharacterized ACR, YfiH family COG1496 313799 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 313800 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 313804 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 313804 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 313806 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 313811 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 313811 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 313813 ENV polyprotein (coat polyprotein) 313813 Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus 313815 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 313817 MCM2/3/5 family 313817 Magnesium chelatase, subunit ChlI. Magnesium-chelatase is a three-component enzyme that catalyses the insertion of Mg2+ into protoporphyrin IX. This is the first unique step in the synthesis of (bacterio)chlorophyll. Due to this, it is thought that Mg-che 313819 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 313820 Protein kinase domain 313823 Phosphotyrosine interaction domain (PTB/PID) 313827 Core histone H2A/H2B/H3/H4 313829 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 313830 LCCL domain 313830 von Willebrand factor type A domain 313831 LCCL domain 313831 von Willebrand factor type A domain 313833 Sulfotransferase protein 313834 Protein kinase domain 313834 PH domain. PH stands for pleckstrin homology 313837 Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally imp 313841 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313842 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313843 Aldose 1-epimerase 313844 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 313845 PH domain. PH stands for pleckstrin homology 313845 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 313846 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 313847 Protein kinase domain 313851 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 313858 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 313858 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 313860 Protein kinase domain 313861 WD domain, G-beta repeat 313861 Translationally controlled tumor protein 313861 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 313862 ENV polyprotein (coat polyprotein) 313866 PH domain. PH stands for pleckstrin homology 313866 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 313866 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 313868 Ribosomal protein L31e 313868 Ribosomal protein L14p/L23e 313874 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 313874 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 313878 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 313879 Calpain family cysteine protease 313880 Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by 313881 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 313889 Enolase, C-terminal TIM barrel domain 313907 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 313910 Protein kinase domain 313911 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 313912 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 313917 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 313918 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 313921 RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to 313925 Ribosomal protein S6e 313925 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 313927 C-5 cytosine-specific DNA methylase 313927 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 313935 C2 domain 313935 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 313939 Sigma-54 interaction domain 313939 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 313940 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 313946 Ribosomal protein L13e 313950 PX domain. PX domains bind to phosphoinositides 313954 von Willebrand factor type A domain 313957 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313960 Helix-loop-helix DNA-binding domain 313961 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 313961 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 313961 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 313970 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 313970 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 313974 Protein kinase domain 313976 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 313978 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 313983 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 313986 HMG (high mobility group) box 313987 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313991 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 313994 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 313997 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 314004 Thymidylate kinase 314009 Ribosomal protein L21e 314010 Lectin C-type domain. This family includes both long and short form C-type 314010 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 314011 Caulimovirus viroplasmin. This family consists of various caulimovirus viroplasmin proteins. The viroplasmin protein is encoded by gene VI and is the main component of viral inclusion bodies or viroplasms. Inclusions are the site of viral assembly, DNA sy 314014 WD domain, G-beta repeat 314016 Animal haem peroxidase 314016 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 314019 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 314030 Ribosomal protein L35Ae 314035 Ribosomal L22e protein family 314037 Ribosomal protein S2 314039 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 314040 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 314041 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 314046 MYND finger 314046 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 314047 Ribosomal protein S26e 314050 Adenosine-deaminase (editase) domain 314054 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 314063 Core histone H2A/H2B/H3/H4 314071 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 314075 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 314092 Protein kinase domain 314095 LCCL domain 314095 von Willebrand factor type A domain 314099 SH2 domain 314108 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 314110 Elongation factor 1 gamma, conserved domain 314110 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 314110 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 314111 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 314112 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314113 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 314114 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314117 Hsp90 protein 314117 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 314118 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 314118 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 314118 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 314121 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 314123 Aminotransferase class I and II 314124 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314124 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314125 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 314126 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 314126 DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors) 314126 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 314129 Ribosomal L29e protein family 314131 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 314138 'Paired box' domain 314140 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 314144 Ribosomal protein L15 314147 Ribosomal protein S6e 314148 Lectin C-type domain. This family includes both long and short form C-type 314151 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 314151 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 314153 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 314154 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 314157 F-box domain 314160 ENV polyprotein (coat polyprotein) 314164 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 314165 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 314166 HMG (high mobility group) box 314173 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 314174 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 314176 Core histone H2A/H2B/H3/H4 314178 Ribosomal protein L21e 314179 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 314180 MAM domain. An extracellular domain found in many receptors 314181 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314181 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314182 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314184 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314184 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314191 NAC domain 314193 Protein kinase domain 314194 Protein kinase domain 314195 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 314198 Protein kinase domain 314200 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 314213 7 transmembrane receptor (rhodopsin family) 314213 7 transmembrane receptor (rhodopsin family) 314215 Ribosomal protein L6 314216 Cyclophilin type peptidyl-prolyl cis-trans isomerase 314218 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314221 Homeobox domain 314222 Ribosomal protein L23 314223 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 314224 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 314225 C2 domain 314245 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 314246 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 314248 Ribosomal S17 314249 PH domain. PH stands for pleckstrin homology 314249 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 314250 PH domain. PH stands for pleckstrin homology 314250 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 314253 Ribosomal protein S7e 314254 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 314258 Adenylate kinase 314259 Guanylate kinase 314259 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 314259 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 314260 PH domain. PH stands for pleckstrin homology 314260 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 314262 PH domain. PH stands for pleckstrin homology 314262 MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins 314264 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 314273 WD domain, G-beta repeat 314274 Ribosomal protein L44 314275 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 314283 Disintegrin 314283 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 314283 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 314293 Ribosomal L39 protein 314296 PWI domain 314297 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314302 Cyclin-dependent kinase regulatory subunit 314304 Dienelactone hydrolase family 314309 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314309 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314312 GDA1/CD39 (nucleoside phosphatase) family 314313 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 314317 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 314317 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 314322 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 314328 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 314332 Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length 314333 Aminotransferase class I and II 314333 Orn/Lys/Arg decarboxylase, major domain 314334 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 314336 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 314336 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 314337 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 314337 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 314345 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314345 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314349 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 314352 Fibronectin type II domain 314353 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314356 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 314360 Glycosyl hydrolase family 59 314374 Fork head domain 314384 Protein kinase domain 314384 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 314386 7 transmembrane receptor (rhodopsin family) 314389 Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer 314391 C2 domain 314396 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells 314397 Vacuolar sorting protein 9 (VPS9) domain 314400 Enolase, N-terminal domain 314400 Enolase, C-terminal TIM barrel domain 314400 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 314406 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 314407 Importin-beta N-terminal domain 314416 Adenylate kinase 314427 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 314430 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 314431 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 314432 Repeat in ubiquitin-activating (UBA) protein 314432 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 314434 Ribosomal protein L6 314435 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314436 Ets-domain 314438 Fatty acid desaturase 314439 'Cold-shock' DNA-binding domain 314441 Mitochondrial carrier protein 314442 WHEP-TRS domain 314442 tRNA synthetases class I (W and Y) 314453 WD domain, G-beta repeat 314457 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 314458 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 314461 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 314462 Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) 314464 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 314465 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 314465 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 314466 Uncharacterized ACR, YggU family COG1872 314468 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 314473 Kinesin motor domain 314480 7 transmembrane receptor (rhodopsin family) 314482 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 314483 Transcription factor TFIIB repeat 314491 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314492 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314493 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314494 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314495 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314496 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314497 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314498 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314499 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314500 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314501 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314503 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314504 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314507 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314509 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314510 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314511 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314512 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314513 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 314514 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 314515 7 transmembrane receptor (rhodopsin family) 314515 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314516 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314518 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314521 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314524 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 314527 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 314532 Ribosomal protein L19e 314537 Ribosomal protein L36e 314537 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 314537 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 314538 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 314540 Hsp90 protein 314540 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 314543 Phosphotyrosine interaction domain (PTB/PID) 314543 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 314543 Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome 314544 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 314547 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 314548 Glycoprotease family 314548 ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER 314549 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 314550 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 314551 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 314553 Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phosp 314555 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 314556 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 314560 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 314563 Helix-loop-helix DNA-binding domain 314566 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 314569 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 314570 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 314570 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 314574 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 314575 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 314582 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 314584 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314586 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314587 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314588 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314589 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 314590 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314591 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314592 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314593 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 314594 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 314596 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 314600 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314601 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314602 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314603 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314605 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314605 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314606 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 314608 7 transmembrane receptor (rhodopsin family) 314610 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314612 SH2 domain 314612 Phosphotyrosine interaction domain (PTB/PID) 314613 Zinc finger, C3HC4 type (RING finger) 314614 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 314614 PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2 314615 Trypsin 314616 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 314617 WD domain, G-beta repeat 314618 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 314618 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 314621 Protein kinase domain 314623 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 314624 Ephrin 314625 TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological 314626 NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and 314627 Protein kinase domain 314627 POLO box duplicated region 314629 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 314633 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 314634 PH domain. PH stands for pleckstrin homology 314635 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 314637 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 314638 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 314640 Guanylate kinase 314640 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 314647 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 314649 7 transmembrane receptor (rhodopsin family) 314651 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 314652 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 314653 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 314654 Myosin head (motor domain) 314654 RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses 314656 Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration 314658 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 314659 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 314660 HMG (high mobility group) box 314661 ENV polyprotein (coat polyprotein) 314661 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 314663 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 314669 Protein kinase domain 314669 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 314669 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 314670 Protein kinase domain 314674 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 314675 WD domain, G-beta repeat 314675 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 314675 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 314676 DNA photolyase. This domain binds a light harvesting cofactor 314676 Antifreeze-like domain. This family contains type III antifreeze proteins as well as a variety of enzymes. This domain is presumed to be involved in sugar binding in the enzyme proteins 314676 NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only 314678 LIM domain. This family represents two copies of the LIM structural domain 314681 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 314682 HMG (high mobility group) box 314688 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 314691 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 314692 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples hav 314694 Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin 314706 Insulin/IGF/Relaxin family. Superfamily includes insulins 314709 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314709 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314711 Ets-domain 314712 ADP-ribosylation factor family 314712 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 314712 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 314715 Ribosomal protein L6 314716 Growth-Arrest-Specific Protein 2 Domain 314716 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 314717 Protein kinase domain 314721 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 314723 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 314727 Phosphotyrosine interaction domain (PTB/PID) 314728 Phosphotyrosine interaction domain (PTB/PID) 314730 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 314733 Ribosomal protein S19e 314743 Protein kinase domain 314745 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 314746 Ubiquitin carboxyl-terminal hydrolase family 2 314746 Ubiquitin carboxyl-terminal hydrolases family 2 314746 Zn-finger in ubiquitin-hydrolases and other protein 314747 PH domain. PH stands for pleckstrin homology 314747 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 314747 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 314754 Death domain 314756 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 314764 Ribosomal protein L6e 314764 Ribosomal protein L6, N-terminal domain 314764 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 314766 Ribosomal L15 314767 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314767 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314771 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 314776 Cyclophilin type peptidyl-prolyl cis-trans isomerase 314777 QXW lectin repeat 314777 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 314778 WD domain, G-beta repeat 314782 Eukaryotic porin 314785 TPR Domain 314795 TPR Domain 314800 AMP-binding enzyme 314803 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314805 C2 domain 314810 HMG (high mobility group) box 314812 Ribosomal protein L31e 314814 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 314818 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 314829 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 314834 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 314844 Ribosomal protein L11, RNA binding domain 314844 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 314845 Core histone H2A/H2B/H3/H4 314847 Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown 314850 PTB domain (IRS-1 type) 314853 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 314854 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 314855 Zinc carboxypeptidase 314856 Zn-finger in Ran binding protein and others 314856 p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding dom 314860 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314862 Protein kinase domain 314870 Death domain 314870 Protein kinase domain 314870 K+ potassium transporter. This is a family of K+ potassium transporters that are conserved across phyla, having both bacterial (KUP), yeast (HAK), and plant (AtKT) sequences as members 314871 Uncharacterized protein family UPF0005 314874 SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad 314879 Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl 314896 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 314902 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 314903 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 314903 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 314904 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 314907 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 314908 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 314908 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 314909 SH2 domain 314909 STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. Th 314909 STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fac 314912 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 314912 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 314927 Ribosomal protein L31e 314940 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 314941 CUB domain 314942 CUB domain 314942 Sushi domain (SCR repeat) 314943 Sushi domain (SCR repeat) 314943 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 314944 CUB domain 314944 Thymosin beta-4 family 314946 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 314950 Ribosomal S3Ae family 314953 Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mos 314954 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314954 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314961 PH domain. PH stands for pleckstrin homology 314961 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 314961 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 314965 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 314969 NOL1/NOP2/sun family 314969 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 314972 Ribosomal protein L19e 314975 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 314975 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 314977 Core histone H2A/H2B/H3/H4 314979 Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains 314981 Fibronectin type III domain 314981 von Willebrand factor type A domain 314981 Thrombospondin N-terminal -like domain 314988 Homeobox domain 314990 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 314990 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 314993 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 314993 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 314995 Ribosomal protein S27 314996 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 314996 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 314997 C2 domain 314999 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 315000 Zinc finger, C3HC4 type (RING finger) 315001 TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme 315005 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315007 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 315007 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 315009 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315023 Mitochondrial carrier protein 315027 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem 315033 TPR Domain 315035 Ribosomal protein S24e 315038 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 315040 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 315042 Protein kinase domain 315046 HMG (high mobility group) box 315047 ENV polyprotein (coat polyprotein) 315047 Golgi 4-transmembrane spanning transporter 315051 Ribosomal L29e protein family 315055 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 315055 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 315065 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 315073 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 315073 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 315074 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 315075 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 315076 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 315077 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 315078 Ribosomal protein L19e 315079 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 315081 Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165 315083 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 315083 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 315085 Nicotinate phosphoribosyltransferase (NAPRTase). Nicotinate phosphoribosyltransferase (EC:2.4.2.11) is the rate limiting enzyme that catalyses the first reaction in the NAD salvage synthesis 315087 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315088 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315089 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 315090 Protein kinase domain 315093 WD domain, G-beta repeat 315094 Domain of unknown function (DUF384). domo keywords :chromosome c26f1.12c 41.3 315094 Domain of unknown function (DUF383). domo keywords :chromosome c26f1.12c 41.3 315095 Protein kinase domain 315096 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 315100 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315101 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315103 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315108 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315109 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 315112 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 315112 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 315114 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 315114 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 315115 PH domain. PH stands for pleckstrin homology 315115 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 315117 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 315119 Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs 315122 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 315124 PH domain. PH stands for pleckstrin homology 315125 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 315131 ER lumen protein retaining receptor 315132 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 315133 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 315133 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 315134 Josephin 315135 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 315138 'chromo' (CHRromatin Organization MOdifier) domain 315141 Mab-21 protein 315143 Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration 315144 SH2 domain 315144 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 315147 Ribosomal L29e protein family 315150 Lyase 315151 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 315151 RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impli 315154 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 315155 metallopeptidase family M24 315156 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 315156 TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1 315156 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 315156 KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun 315159 BTG1 family. A novel family of anti-proliferative proteins 315165 Melibiase 315170 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315173 Acyl transferase domain 315174 CUB domain 315178 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 315180 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 315183 Sulfotransferase protein 315185 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 315186 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 315188 NAC domain 315196 wnt family 315203 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 315205 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 315205 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 315206 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 315210 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 315210 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 315210 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 315211 pfam02892, zf-BED, BED zinc finger 315214 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 315214 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 315216 Uncharacterized ACR, YdiU/UPF0061 family 315219 Glycosyl transferase family, a/b domain. This family includes anthranilate phosphoribosyltransferase (TrpD), thymidine phosphorylase. All these proteins can transfer a phosphorylated ribose substrate 315219 pfam02885, Glycos_trans_3N, Glycosyl transferase family, helical bundle domain. This family includes anthranilate phosphoribosyltransferase (TrpD), thymidine phosphorylase. All these proteins can transfer a phosphorylated ribose substrate 315219 SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c ox 315220 Phosphotyrosine interaction domain (PTB/PID) 315222 Sulfatase 315223 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 315223 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 315230 C2 domain 315231 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 315231 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 315234 Sugar (and other) transporter 315237 Sugar (and other) transporter 315238 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 315241 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 315248 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 315250 Ribosomal protein L15 315252 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 315254 Ribosomal protein L35Ae 315255 Zn-finger in Ran binding protein and others 315255 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315259 LIM domain. This family represents two copies of the LIM structural domain 315263 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 315263 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 315264 Eukaryotic ribosomal protein L18 315264 Uncharacterized protein family UPF0024 315265 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 315284 Somatomedin B domain 315287 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 315289 7 transmembrane receptor (rhodopsin family) 315290 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315291 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 315292 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 315294 wnt family 315298 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 315298 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 315299 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 315299 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 315303 AMP-binding enzyme 315305 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 315305 pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB 315306 ubiE/COQ5 methyltransferase family 315310 QXW lectin repeat 315310 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 315311 Ribosomal protein L14p/L23e 315311 HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3- 315312 LIM domain. This family represents two copies of the LIM structural domain 315318 Intermediate filament protein 315318 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 315319 Intermediate filament protein 315321 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 315323 Intermediate filament protein 315323 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 315324 Intermediate filament protein 315324 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 315324 Vps53-like, N-terminal domain. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 315324 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 315324 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 315324 IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation 315324 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 315326 SH2 domain 315326 Phosphotyrosine interaction domain (PTB/PID) 315327 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 315330 Peptidase family C50 315333 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 315337 Homeobox domain 315337 Tubulin binding cofactor A 315338 Homeobox domain 315341 Homeobox domain 315344 Uracil DNA glycosylase superfamily 315344 Uracil DNA glycosylase superfamily 315345 Clathrin adaptor complex small chain 315345 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 315350 PTB domain (IRS-1 type) 315350 PH domain. PH stands for pleckstrin homology 315352 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 315358 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 315358 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 315360 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 315362 Protein kinase domain 315363 Ribosomal protein S19e 315364 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315365 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 315369 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315371 Cyclophilin type peptidyl-prolyl cis-trans isomerase 315372 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 315374 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 315378 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 315379 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315380 HIT family 315382 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 315385 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 315387 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 315387 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 315388 Trypsin 315389 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 315389 TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1 315389 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 315389 KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun 315394 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 315394 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 315395 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315398 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315399 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 315399 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 315400 Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors 315404 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315405 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 315406 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 315406 OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata 315407 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 315407 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 315407 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315409 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 315412 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315417 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 315429 jmjN domain 315429 jmjC domain 315430 7 transmembrane receptor (rhodopsin family) 315432 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 315432 PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the 315434 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 315435 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 315435 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 315436 Ribosomal protein S6e 315437 Virulence determinant. The UK protein is an African swine fever virus (ASFV) protein that is highly conserved amongst strains, and is an important viral virulence determinant for domestic pigs 315438 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 315439 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 315442 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 315444 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 315452 7 transmembrane receptor (rhodopsin family) 315455 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315457 7 transmembrane receptor (rhodopsin family) 315458 7 transmembrane receptor (rhodopsin family) 315458 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315459 7 transmembrane receptor (rhodopsin family) 315461 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 315467 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315469 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315470 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315471 Ribosomal L10 315478 PX domain. PX domains bind to phosphoinositides 315478 PXA domain. This domain is associated with PX domains pfam00787 315479 Elongation factor 1 gamma, conserved domain 315479 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 315479 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 315484 Proteasome A-type and B-type 315487 Rad17 cell cycle checkpoint protein 315487 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 315492 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 315492 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 315493 Intermediate filament protein 315497 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 315504 Glycosyl hydrolases family 35 315505 Ribosomal protein L23 315506 Glycosyl hydrolases family 35 315507 Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold 315507 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 315508 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 315509 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315510 Fibronectin type III domain 315510 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315521 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 315527 Homeobox domain 315530 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 315532 Ets-domain 315533 Sterile alpha motif (SAM)/Pointed domain 315534 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 315534 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 315545 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315546 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315547 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 315548 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins 315548 ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems 315548 Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of t 315550 RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to 315554 tRNA pseudouridine synthase. Involved in the formation of pseudouridine at the anticodon stem and loop of transfer-RNAs Pseudouridine is an isomer of uridine (5-(beta-D-ribofuranosyl) uracil, and id the most abundant modified nucleoside found in all cellu 315558 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315559 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 315568 7 transmembrane receptor (rhodopsin family) 315570 7 transmembrane receptor (rhodopsin family) 315571 7 transmembrane receptor (rhodopsin family) 315572 7 transmembrane receptor (rhodopsin family) 315573 7 transmembrane receptor (rhodopsin family) 315575 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 315575 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315575 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 315576 ENV polyprotein (coat polyprotein) 315579 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 315579 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 315583 Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu 315597 CUB domain 315597 Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines 315598 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 315600 Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain 315604 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 315604 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 315606 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 315618 Protein kinase domain 315634 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315638 Fibronectin type III domain 315639 Ubiquitin carboxyl-terminal hydrolase family 2 315639 Ubiquitin carboxyl-terminal hydrolases family 2 315642 Ribosomal protein L15 315643 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 315643 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 315644 Ribosomal protein S15 315644 Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequence 315646 Ferredoxin-fold anticodon binding domain. This is the anticodon binding domain found in some phenylalanyl tRNA synthetases. The domain has a ferredoxin fold 315647 Ribosomal protein L15 315648 HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats 315648 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 315650 BTG1 family. A novel family of anti-proliferative proteins 315655 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 315655 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 315656 Ribosomal protein S19e 315661 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 315661 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 315664 Filamin/ABP280 repeat 315668 ADP-ribosylation factor family 315668 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 315673 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 315675 Olfactomedin-like domain 315675 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 315675 Olfactomedin-like domain 315675 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 315681 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 315685 Protein kinase domain 315689 HMG (high mobility group) box 315691 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315696 PX domain. PX domains bind to phosphoinositides 315696 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 315699 Protein-tyrosine phosphatase 315699 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 315704 Flavoprotein. This family contains diverse flavoprotein enzymes. This family includes epidermin biosynthesis protein, EpiD, which has been shown to be a flavoprotein that binds FMN. This enzyme catalyzes the removal of two reducing equivalents from the cy 315705 Ribosomal S3Ae family 315707 SH2 domain 315707 Protein kinase domain 315708 YjeF-related protein N-terminus 315710 Single-strand binding protein family 315712 C2 domain 315714 Lysyl oxidase 315716 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315723 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 315729 Ribosomal protein S5, C-terminal domain 315731 Ribosomal protein L21e 315732 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 315732 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 315740 Kinesin motor domain 315740 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 315741 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 315744 von Willebrand factor type A domain 315745 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 315748 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 315750 Glycosyl hydrolase family 1 315755 Thymosin beta-4 family 315756 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 315756 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 315756 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 315758 Prolyl oligopeptidase family 315758 Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site 315759 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 315762 ADP-ribosylation factor family 315762 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 315763 Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate defor 315764 PH domain. PH stands for pleckstrin homology 315765 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315768 Ribosomal protein S5, C-terminal domain 315768 Ribosomal protein S5, N-terminal domain 315771 WD domain, G-beta repeat 315771 Regulator of chromosome condensation (RCC1) 315771 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 315771 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 315773 Ubiquitin carboxyl-terminal hydrolases family 2 315776 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 315776 I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin 315778 Ribosomal protein L44 315781 Peptidase family C54 315786 Phosphorylase family 2 315788 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 315792 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315793 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 315793 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 315795 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 315803 Ribosomal L38e protein family 315805 Ribosomal protein L21e 315806 Fibronectin type III domain 315806 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 315807 Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability 315809 Ribosomal L29e protein family 315811 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 315813 C2 domain 315819 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315822 Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription 315824 Transforming growth factor beta like domain 315824 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 315826 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 315827 HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase 315828 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 315833 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 315833 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 315836 Ribosomal protein L11, RNA binding domain 315836 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 315840 Myosin head (motor domain) 315843 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 315844 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellul 315845 Hsp90 protein 315851 GNS1/SUR4 family 315852 Protein kinase domain 315858 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 315860 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 315866 Trypsin 315868 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315870 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 315871 PXA domain. This domain is associated with PX domains pfam00787 315872 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 315876 PH domain. PH stands for pleckstrin homology 315877 Guanine nucleotide exchange factor for Ras-like GTPases 315877 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 315877 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 315878 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 315879 Thrombospondin type 1 domain 315879 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 315879 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 315880 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 315883 Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved 315887 Ribosomal protein S5, N-terminal domain 315887 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 315889 ENV polyprotein (coat polyprotein) 315889 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 315901 Sulfotransferase protein 315903 Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding 315903 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315904 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 315907 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 315908 UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p 315909 14-3-3 protein 315910 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 315911 Zinc finger, C3HC4 type (RING finger) 315911 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 315912 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 315912 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 315914 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 315918 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 315919 Ribosomal protein S2 315919 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 315923 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 315926 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 315926 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 315932 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 315935 C2 domain 315936 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 315939 Histidine acid phosphatase 315941 Mitochondrial carrier protein 315946 Nucleoside diphosphate kinase 315947 Nucleoside diphosphate kinase 315947 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 315950 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 315953 PMP-22/EMP/MP20/Claudin family 315954 S25 ribosomal protein 315955 Tissue factor 315957 Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region 315958 Stromal antigen (SA/STAG) protein 315968 ADP-ribosylation factor family 315968 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 315969 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 315971 HMG (high mobility group) box 315973 Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold 315973 Acyl-CoA dehydrogenase, N-terminal domain. The N-terminal domain of Acyl-CoA dehydrogenase is an all-alpha domain 315973 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 315974 7 transmembrane receptor (rhodopsin family) 315975 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 315978 Protein kinase domain 315979 von Willebrand factor type A domain 315979 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 315980 von Willebrand factor type A domain 315981 von Willebrand factor type A domain 315982 WD domain, G-beta repeat 315988 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 315993 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 315994 Protein kinase domain 315995 Ribosomal protein S2 315996 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 315997 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 315999 SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs 316000 Actin 316002 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 316005 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 316006 Regulatory subunit of type II PKA R-subunit 316009 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 316009 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 316009 Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p 316010 Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins 316012 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 316012 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 316014 Beige/BEACH domain 316014 WD domain, G-beta repeat 316015 Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re 316016 Trypsin 316017 Vacuolar sorting protein 9 (VPS9) domain 316017 MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000 316019 7 transmembrane receptor (rhodopsin family) 316022 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t 316023 Doublecortin 316023 Protein kinase domain 316028 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 316033 Glycosyl hydrolases family 35 316033 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 316045 Eukaryotic porin 316048 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 316051 Intermediate filament protein 316051 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 316052 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 316053 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 316055 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 316061 Sugar (and other) transporter 316062 Sugar (and other) transporter 316064 Protein kinase domain 316065 Ribosomal protein L10 316066 Protein kinase domain 316067 FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain 316067 FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain 316068 Ribosomal protein S27 316069 Ribosomal protein S27 316073 Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round 316075 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 316077 GDA1/CD39 (nucleoside phosphatase) family 316077 GDA1/CD39 (nucleoside phosphatase) family 316077 GDA1/CD39 (nucleoside phosphatase) family 316077 GDA1/CD39 (nucleoside phosphatase) family 316080 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 316086 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 316087 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 316087 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 316088 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 316088 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 316090 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316094 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 316095 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 316095 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 316096 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 316097 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 316098 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 316098 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase 316099 Lectin C-type domain. This family includes both long and short form C-type 316101 LIM domain. This family represents two copies of the LIM structural domain 316103 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 316105 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316106 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 316107 7 transmembrane receptor (rhodopsin family) 316107 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 316109 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 316110 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 316110 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316113 Helix-hairpin-helix motif 316118 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 316120 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 316120 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316122 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 316124 AMP-binding enzyme 316126 RanBP1 domain 316129 Zinc finger, C3HC4 type (RING finger) 316129 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 316129 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316129 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 316130 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 316131 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 316132 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 316134 ENV polyprotein (coat polyprotein) 316137 7 transmembrane receptor (Secretin family) 316137 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 316138 7 transmembrane receptor (metabotropic glutamate family) 316142 ENV polyprotein (coat polyprotein) 316142 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 316142 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 316144 7 transmembrane receptor (metabotropic glutamate family) 316147 Ribosomal protein L15 316153 Sulfotransferase protein 316156 Sulfotransferase protein 316157 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 316158 Ribosomal protein S26e 316161 Ribosomal protein L31e 316164 CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein 316166 Homeobox domain 316168 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 316173 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 316202 MoaC family. Members of this family are involved in molybdenum cofactor biosynthesis. However their molecular function is not known 316208 Fork head domain 316210 cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of 316211 Ubiquitin carboxyl-terminal hydrolase family 2 316211 Ubiquitin carboxyl-terminal hydrolases family 2 316211 Zn-finger in ubiquitin-hydrolases and other protein 316212 LIM domain. This family represents two copies of the LIM structural domain 316213 PTB domain (IRS-1 type) 316214 Helix-loop-helix DNA-binding domain 316231 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 316231 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 316234 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 316235 impB/mucB/samB family. These proteins are involved in UV protection 316236 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 316241 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 316242 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 316249 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 316256 Death domain 316258 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 316259 7 transmembrane receptor (rhodopsin family) 316259 7 transmembrane receptor (Secretin family) 316259 7 transmembrane receptor (rhodopsin family) 316259 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 316262 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316265 Phosphoglycerate kinase 316266 SCP-like extracellular protein. This domain is also found in prokaryotes 316269 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 316270 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316273 MCM2/3/5 family 316275 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 316276 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 316281 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 316281 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 316282 Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised 316283 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 316283 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 316284 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 316284 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 316285 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 316294 Thrombospondin type 1 domain 316294 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 316296 Thrombospondin N-terminal -like domain 316296 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 316299 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 316304 Glutamine synthetase, beta-Grasp domain 316304 Glutamine synthetase, catalytic domain 316310 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 316312 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 316312 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 316313 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 316316 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 316317 Protein-tyrosine phosphatase 316317 IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing 316318 Trypsin 316323 Ribosomal protein L21e 316327 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 316330 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 316338 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316341 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 316341 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 316341 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 316342 Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for 316343 MIT domain 316344 impB/mucB/samB family. These proteins are involved in UV protection 316344 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 316348 Phosducin 316354 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 316364 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 316369 SH2 domain 316369 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 316370 Filamin/ABP280 repeat 316372 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 316373 HMG (high mobility group) box 316376 Inositol monophosphatase family 316377 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316378 HMG (high mobility group) box 316379 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 316379 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316383 Ribosomal protein L14p/L23e 316385 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 316386 Ribosomal protein L21e 316390 WD domain, G-beta repeat 316393 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 316395 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 316395 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 316401 ribosomal L5P family C-terminus. This region is found associated with pfam00281 316403 tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included 316403 tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases 316408 Homeobox domain 316409 HSF-type DNA-binding 316419 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 316421 [2Fe-2S] binding domain 316421 CO dehydrogenase flavoprotein C-terminal domain 316421 FAD binding domain in molybdopterin dehydrogenase 316421 Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain 316421 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 316422 Ribosomal protein L21e 316423 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 316423 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316424 [2Fe-2S] binding domain 316424 CO dehydrogenase flavoprotein C-terminal domain 316424 FAD binding domain in molybdopterin dehydrogenase 316424 Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain 316424 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 316434 ATP synthase subunit C 316435 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 316443 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 316444 Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger 316444 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 316444 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 316446 Zinc carboxypeptidase 316452 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 316455 PH domain. PH stands for pleckstrin homology 316456 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 316461 Indole-3-glycerol phosphate synthase 316461 Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate 316461 Thiamine monophosphate synthase/TENI. Thiamine monophosphate synthase (TMP) (EC:2.5.1.3) catalyzes the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole phosphate to y 316467 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 316468 WD domain, G-beta repeat 316469 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316477 Microtubule associated protein (MAP65/ASE1 family) 316481 Acyl CoA binding protein 316482 LCCL domain 316482 SCP-like extracellular protein. This domain is also found in prokaryotes 316483 14-3-3 protein 316484 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 316495 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 316500 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 316503 Hsp90 protein 316504 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 316506 Fibronectin type III domain 316507 SH2 domain 316516 Uncharacterized protein family UPF0005 316518 Metallo-beta-lactamase superfamily 316519 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b 316520 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 316521 Villin headpiece domain 316522 Ubiquitin carboxyl-terminal hydrolases family 2 316523 Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes 316525 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 316526 wnt family 316527 wnt family 316530 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict 316531 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 316531 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 316541 Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand 316543 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 316545 Clathrin adaptor complex small chain 316550 ADP-ribosylation factor family 316550 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 316556 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 316556 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316557 Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa 316559 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316559 Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analogu 316577 F-box domain 316579 7 transmembrane receptor (rhodopsin family) 316580 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 316580 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 316580 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 316580 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 316583 Ribosomal protein L36e 316583 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 316587 Poly-adenylate binding protein, unique domain 316587 RNB-like protein. The function of this region of similarity is uncertain 316588 RNB-like protein. The function of this region of similarity is uncertain 316590 RNB-like protein. The function of this region of similarity is uncertain 316592 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 316595 GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein 316600 UDP-glucoronosyl and UDP-glucosyl transferase 316601 UDP-glucoronosyl and UDP-glucosyl transferase 316608 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 316610 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 316611 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 316611 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 316614 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 316615 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 316624 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 316624 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 316626 Helix-loop-helix DNA-binding domain 316628 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 316630 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 316632 Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes 316633 7 transmembrane receptor (rhodopsin family) 316633 Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs 316634 7 transmembrane receptor (rhodopsin family) 316636 Intermediate filament protein 316638 Fibronectin type III domain 316638 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 316639 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 316639 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 316640 Peptidase family C54 316641 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316645 CBF/Mak21 family 316646 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316651 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 316653 ENV polyprotein (coat polyprotein) 316656 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 316657 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 316658 NAC domain 316658 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 316660 RanBP1 domain 316663 F-box domain 316679 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 316680 PH domain. PH stands for pleckstrin homology 316685 Histidine acid phosphatase 316687 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 316689 C2 domain 316691 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure 316700 Protein kinase domain 316700 Protein kinase C terminal domain 316708 Core histone H2A/H2B/H3/H4 316710 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 316711 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr 316715 UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid 316717 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 316718 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316721 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316722 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316723 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316725 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 316726 Cyclophilin type peptidyl-prolyl cis-trans isomerase 316727 Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73 316733 'Cold-shock' DNA-binding domain 316734 Ribosomal protein L44 316740 Fibronectin type III domain 316745 Ribosomal protein L21e 316748 Ribosomal protein S24e 316752 Citrate transporter 316762 Fibronectin type III domain 316763 Uncharacterized ACR, COG1579 316763 ADP-ribosylation factor family 316763 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 316765 Protein kinase domain 316768 Protein kinase domain 316768 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316769 Protein kinase domain 316770 Protein kinase domain 316773 Respiratory-chain NADH dehydrogenase 24 Kd subunit 316773 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 316774 Respiratory-chain NADH dehydrogenase 24 Kd subunit 316777 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 316781 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 316794 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 316798 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 316816 RNB-like protein. The function of this region of similarity is uncertain 316818 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316821 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316822 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316823 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316824 Ribosomal protein L19e 316824 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316825 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316826 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 316843 Ribosomal protein L19e 316844 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316846 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316848 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316850 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316851 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316854 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316856 Ribosomal protein L19e 316858 7 transmembrane receptor (metabotropic glutamate family) 316860 Protein kinase domain 316862 Protein kinase domain 316864 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 316865 ENV polyprotein (coat polyprotein) 316865 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 316876 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 316877 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 316884 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316884 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316885 Ribosomal protein L19e 316886 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316891 7 transmembrane receptor (rhodopsin family) 316892 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316893 Ribosomal protein L19e 316893 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316897 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316899 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316900 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316900 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 316902 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 316914 ENV polyprotein (coat polyprotein) 316917 Ribosomal protein L35Ae 316925 Myosin head (motor domain) 316928 Glutaredoxin 316932 Glutaredoxin 316933 Glutaredoxin 316937 Glutaredoxin 316939 Glutaredoxin 316940 Glutaredoxin 316941 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 316943 Myosin head (motor domain) 316946 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 316949 Myosin head (motor domain) 316950 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 316951 Uncharacterized ACR, COG1579 316951 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 316953 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316953 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 316954 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 316956 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316958 ENV polyprotein (coat polyprotein) 316959 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316962 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 316965 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 316970 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 316973 Myosin head (motor domain) 316976 Ribosomal RNA adenine dimethylase 316979 Ribosomal protein S5, C-terminal domain 316980 ENV polyprotein (coat polyprotein) 316984 Ribosomal protein S8e 316987 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 316988 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 316989 7 transmembrane receptor (rhodopsin family) 316990 7 transmembrane receptor (rhodopsin family) 316991 7 transmembrane receptor (rhodopsin family) 316996 Protein kinase domain 316996 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 316997 Myosin head (motor domain) 317001 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 317002 Myosin head (motor domain) 317002 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 317006 Protein kinase domain 317010 Myosin head (motor domain) 317012 Myosin head (motor domain) 317017 Myosin head (motor domain) 317019 Myosin head (motor domain) 317023 Myosin head (motor domain) 317027 Skp1 family, tetramerisation domain 317029 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317030 Ribosomal protein L19e 317030 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317031 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 317033 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 317034 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 317034 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 317037 Ribosomal protein L19e 317038 Ribosomal protein L19e 317040 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317042 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317043 Ribosomal protein L19e 317054 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 317062 ENV polyprotein (coat polyprotein) 317064 ADP-ribosylation factor family 317064 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 317067 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 317068 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317071 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317081 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 317085 Myosin head (motor domain) 317086 Myosin head (motor domain) 317088 Myosin head (motor domain) 317091 Myosin head (motor domain) 317093 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 317100 Myosin head (motor domain) 317102 Myosin head (motor domain) 317112 Myosin head (motor domain) 317114 Myosin head (motor domain) 317115 Myosin head (motor domain) 317117 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 317124 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317127 Ribosomal protein L19e 317129 7 transmembrane receptor (rhodopsin family) 317131 Protein kinase domain 317137 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 317137 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317140 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 317142 Myosin head (motor domain) 317146 Glutaredoxin 317147 ENV polyprotein (coat polyprotein) 317147 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317149 Protein kinase domain 317150 Protein kinase domain 317152 Myosin head (motor domain) 317153 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 317155 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 317160 Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24 317161 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 317162 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 317163 Eukaryotic initiation factor 1A 317164 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 317164 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 317165 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 317165 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 317166 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 317166 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 317168 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 317168 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 317170 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 317178 TPR Domain 317178 jmjC domain 317181 Sushi domain (SCR repeat) 317181 HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unce 317182 Actin 317183 Cadherin domain 317187 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 317196 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 317196 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 317198 Beige/BEACH domain 317202 Cation-dependent mannose-6-phosphate receptor 317203 Protein kinase domain 317203 Protein kinase C terminal domain 317204 Cadherin domain 317205 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 317206 ENV polyprotein (coat polyprotein) 317207 Ribosomal protein S5, N-terminal domain 317208 Kinesin motor domain 317209 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 317211 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 317213 Cyclophilin type peptidyl-prolyl cis-trans isomerase 317213 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 317214 Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos 317216 Eukaryotic protein of unknown function, DUF279 317217 Core histone H2A/H2B/H3/H4 317218 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 317219 7 transmembrane receptor (rhodopsin family) 317222 Ribosomal protein L24e 317223 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 317224 Fibronectin type III domain 317226 Fibronectin type III domain 317227 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 317231 Ribosomal L10 317232 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317233 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 317233 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 317235 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 317236 Calsequestrin 317236 Nebulin repeat 317241 Zinc finger, C3HC4 type (RING finger) 317242 Ribosomal L38e protein family 317244 Ribosomal protein L34e 317244 Ribosomal protein S5, N-terminal domain 317247 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 317250 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317252 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 317252 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 317252 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 317253 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 317256 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 317257 ENV polyprotein (coat polyprotein) 317257 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 317259 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317264 Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain 317264 Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain 317268 Homeobox domain 317270 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317271 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317273 ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus 317274 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317275 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 317276 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 317279 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317284 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317285 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317290 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 317293 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 317294 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 317296 pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins 317297 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317298 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 317298 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 317299 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 317300 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 317301 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317302 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317304 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317306 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 317308 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 317309 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317312 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 317313 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 317314 HMG (high mobility group) box 317316 Zona pellucida-like domain 317318 HMG (high mobility group) box 317321 HMG (high mobility group) box 317322 Glypican 317323 HMG (high mobility group) box 317324 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317326 HMG (high mobility group) box 317332 Ribosomal L29e protein family 317332 Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation 317335 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 317335 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 317336 Ribosomal protein S19e 317337 Ribosomal protein S19e 317339 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 317339 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 317341 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 317346 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 317352 Core histone H2A/H2B/H3/H4 317353 Core histone H2A/H2B/H3/H4 317356 Cadherin domain 317358 Core histone H2A/H2B/H3/H4 317358 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 317359 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 317361 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 317362 Zn-finger in ubiquitin-hydrolases and other protein 317362 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 317363 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 317364 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 317365 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 317366 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 317368 MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue 317369 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317370 WD domain, G-beta repeat 317371 P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB) 317371 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl 317373 Ribosomal protein S19e 317374 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 317375 Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc 317376 Helix-loop-helix DNA-binding domain 317377 Ribosomal L10 317379 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 317380 LIM domain. This family represents two copies of the LIM structural domain 317382 Fork head domain 317383 Ribosomal L10 317383 Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p 317385 Zn-finger in Ran binding protein and others 317385 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317389 Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin 317389 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 317393 NUDIX domain 317394 Polyprenyl synthetase 317395 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 317396 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317396 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 317398 Ubiquitin carboxyl-terminal hydrolase family 2 317398 Ubiquitin carboxyl-terminal hydrolases family 2 317399 Ubiquitin carboxyl-terminal hydrolase family 2 317399 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 317399 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 317401 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 317414 Insulinase (Peptidase family M16) 317415 MCM2/3/5 family 317416 von Willebrand factor type A domain 317417 GTPase of unknown function 317417 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317417 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 317417 Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i 317417 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317419 Ribosomal protein L21e 317420 Protein kinase domain 317422 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 317425 jmjC domain 317425 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 317426 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317427 Protein kinase domain 317428 HMG (high mobility group) box 317433 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 317439 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 317440 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 317444 Cytochrome c/c1 heme lyase 317446 Glutaredoxin 317449 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m 317450 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m 317451 EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m 317454 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 317455 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 317461 Cytochrome c/c1 heme lyase 317468 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 317470 MAM domain. An extracellular domain found in many receptors 317476 Macrophage migration inhibitory factor (MIF) 317476 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317479 CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14 317479 MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins 317481 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 317482 Uncharacterized BCR, YhhW family COG1741 317488 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 317488 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 317489 Glutamine amidotransferase class-I 317491 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 317496 Protein kinase domain 317497 Ribosomal protein L19e 317498 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 317501 Ribosomal protein L13e 317506 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 317506 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 317509 WD domain, G-beta repeat 317510 Actin 317515 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 317517 Ribosomal protein L34e 317517 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 317520 Clathrin light chain 317521 Protein kinase domain 317521 Protein kinase C terminal domain 317525 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 317525 Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy 317533 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317534 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 317537 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 317538 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 317541 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 317544 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 317545 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 317550 ADP-ribosylation factor family 317550 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 317550 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 317552 ENV polyprotein (coat polyprotein) 317553 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 317553 TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance 317554 Proteasome A-type and B-type 317559 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 317561 Josephin 317566 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 317566 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 317566 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 317569 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 317569 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 317571 HMG (high mobility group) box 317572 HMG (high mobility group) box 317573 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 317573 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 317574 HMG (high mobility group) box 317575 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 317575 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 317575 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 317576 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 317576 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 317580 ADP-ribosylation factor family 317580 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 317581 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317583 HMG (high mobility group) box 317584 HMG (high mobility group) box 317585 HMG (high mobility group) box 317588 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 317589 Protein kinase domain 317590 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317593 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 317597 HMG (high mobility group) box 317598 PH domain. PH stands for pleckstrin homology 317599 E1-E2 ATPase 317601 HMG (high mobility group) box 317602 HMG (high mobility group) box 317604 HMG (high mobility group) box 317605 HMG (high mobility group) box 317606 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 317607 7 transmembrane receptor (rhodopsin family) 317608 7 transmembrane receptor (rhodopsin family) 317608 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 317609 HMG (high mobility group) box 317612 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317613 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 317615 Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5 317615 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 317618 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 317625 Polyprenyl synthetase 317626 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length 317627 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 317627 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 317628 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 317631 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 317631 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 317646 Ribosomal protein L13 317649 Eukaryotic initiation factor 4E 317674 Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar 317675 Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein 317677 CUB domain 317677 Sushi domain (SCR repeat) 317701 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 317702 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 317703 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 317705 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 317717 Synaptobrevin 317719 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of 317750 Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell 319100 7 transmembrane receptor (rhodopsin family) 319101 Intermediate filament protein 319104 7 transmembrane receptor (rhodopsin family) 319105 7 transmembrane receptor (rhodopsin family) 319106 7 transmembrane receptor (rhodopsin family) 319107 7 transmembrane receptor (rhodopsin family) 319146 Interferon alpha/beta domain 319164 Core histone H2A/H2B/H3/H4 319166 Core histone H2A/H2B/H3/H4 319179 Core histone H2A/H2B/H3/H4 319179 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 319189 Core histone H2A/H2B/H3/H4 319189 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 319195 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 319196 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 319196 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 319197 7 transmembrane receptor (rhodopsin family) 319197 7 transmembrane receptor (rhodopsin family) 319200 7 transmembrane receptor (rhodopsin family) 319210 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 319214 7 transmembrane receptor (rhodopsin family) 319214 7 transmembrane receptor (rhodopsin family) 319217 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 319239 7 transmembrane receptor (rhodopsin family) 319239 7 transmembrane receptor (rhodopsin family) 319249 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 319259 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 319262 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 319281 L1 transposable element 319281 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 319281 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 319322 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 319322 PSP. Proline rich domain found in numerous spliceosome associated proteins 319322 Domain of unknown function (DUF382). This domain is specific to the human splicing factor 3b subunit 2 and it's orthologs 319322 SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins 319358 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 319387 Olfactomedin-like domain 319387 Galactose binding lectin domain 319387 7 transmembrane receptor (Secretin family) 319387 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 319387 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 319387 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coup 319430 7 transmembrane receptor (rhodopsin family) 319430 7 transmembrane receptor (rhodopsin family) 319433 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 319446 Renal dipeptidase 319446 Renal dipeptidase 319448 Fibronectin type III domain 319468 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 319480 von Willebrand factor type A domain 319480 von Willebrand factor type A domain 319480 FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs 319481 WD domain, G-beta repeat 319482 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 319482 von Willebrand factor type D domain 319482 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 319489 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 319504 Fibronectin type III domain 319504 Fibronectin type III domain 319504 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 319508 C2 domain 319508 C2 domain 319508 C2 domain 319518 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 319518 Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in 319520 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 319520 Protein-tyrosine phosphatase 319520 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 319520 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 319535 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 319545 ENV polyprotein (coat polyprotein) 319554 NUDIX domain 319555 WD domain, G-beta repeat 319573 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 319583 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not 319583 ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653 319583 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 319586 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 319601 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 319606 Importin-beta N-terminal domain 319615 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 319625 Aldose 1-epimerase 319634 GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases 319642 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 319642 ADP-ribosylation factor family 319642 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 319642 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 319653 Mitochondrial carrier protein 319653 Mitochondrial carrier protein 319660 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M 319679 L1 transposable element 319679 Actin interacting protein 3 319679 dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate 319679 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 319679 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 319679 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 319679 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 319701 F-box domain 319710 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 319734 Cache domain 319740 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 319765 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 319765 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 319790 WD domain, G-beta repeat 319790 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 319800 Sugar (and other) transporter 319800 Sugar (and other) transporter 319801 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 319817 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 319836 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 319848 LacY proton/sugar symporter. This family is closely related to the sugar transporter family 319875 Trypsin 319888 Acyltransferase family. This family includes a range of acyltransferase enzymes. This domain is found in many as yet uncharacterised C. elegans proteins and it is approximately 300 amino acids long 319888 Acyltransferase family. This family includes a range of acyltransferase enzymes. This domain is found in many as yet uncharacterised C. elegans proteins and it is approximately 300 amino acids long 319922 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 319922 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 319924 Phosphotyrosine interaction domain (PTB/PID) 319924 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 319924 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 319933 L1 transposable element 319934 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 319934 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 319944 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 319945 Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N ty 319953 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 319953 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 319965 C2 domain 319991 Kinesin motor domain 319991 Kinesin motor domain 319997 Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342 320024 Carboxylesterase 320040 Zinc finger, C3HC4 type (RING finger) 320043 7 transmembrane receptor (rhodopsin family) 320067 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 320078 Olfactomedin-like domain 320078 Olfactomedin-like domain 320100 GrpE 320106 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr 320106 Tryptophan/tyrosine permease family 320106 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 320109 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 320119 MIT domain 320119 Protein kinase domain 320127 Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown 320129 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 320129 Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits 320139 Protein-tyrosine phosphatase 320157 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 320182 Helix-loop-helix DNA-binding domain 320182 Helix-loop-helix DNA-binding domain 320183 SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad 320191 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 320202 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 320204 Conserved hypothetical protein 95 320204 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 320204 N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e 320213 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 320244 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 320244 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 320267 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 320267 KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 320292 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 320302 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain conta 320302 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 320332 Core histone H2A/H2B/H3/H4 320332 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 320338 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 320355 Lipase 320376 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 320404 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 320405 PH domain. PH stands for pleckstrin homology 320407 Lectin C-type domain. This family includes both long and short form C-type 320435 Vacuolar sorting protein 9 (VPS9) domain 320438 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum 320438 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m 320452 2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2 320454 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 320454 Trypsin 320454 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 320484 GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position 320502 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 320534 Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra 320560 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 320560 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 320560 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 320560 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si 320560 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 320560 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 320560 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 320560 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 320563 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 320571 E1-E2 ATPase 320590 Sugar (and other) transporter 320595 jmjC domain 320595 jmjC domain 320595 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 320615 Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in 320615 Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in 320634 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 320634 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 320635 Oxidoreductase FAD-binding domain 320635 Oxidoreductase FAD-binding domain 320635 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 320651 Ubiquitin carboxyl-terminal hydrolase family 2 320651 Ubiquitin carboxyl-terminal hydrolases family 2 320655 Per1-like. A member of this family has been implemented in protein processing in the endoplasmic reticulum 320664 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 320676 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 320678 Intermediate filament protein 320678 Intermediate filament protein 320679 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a 320683 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 320685 Cytidine and deoxycytidylate deaminase zinc-binding region 320712 Fibronectin type III domain 320712 Fibronectin type III domain 320713 Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family 320718 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 320718 Sulfate transporter family. Mutations may lead to several human diseases 320718 STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP 320727 Importin-beta N-terminal domain 320727 CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis 320734 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 320734 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 320745 Ubiquitin carboxyl-terminal hydrolases family 2 320747 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 320747 Leucine Rich Repeat. CAUTION: This Pfam may not find all Leucine Rich Repeats in a protein. Leucine Rich Repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in 320769 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 320795 Hr1 repeat 320799 Homeobox domain 320808 WD domain, G-beta repeat 320825 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 320858 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 320864 Intermediate filament protein 320873 Cadherin domain 320878 LIM domain. This family represents two copies of the LIM structural domain 320878 FAD binding domain. This domain is involved in FAD binding in a number of enzymes 320878 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 320878 Thi4 family. This family includes a putative thiamine biosynthetic enzyme 320878 LIM domain. This family represents two copies of the LIM structural domain 320878 FAD binding domain. This domain is involved in FAD binding in a number of enzymes 320878 Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain 320878 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 320898 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 320898 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 320909 Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events 320910 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 320910 Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF 320910 Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF 320940 E1-E2 ATPase 320941 L1 transposable element 320951 Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <=> phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in t 320974 Fibronectin type III domain 320981 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 320981 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 320982 ADP-ribosylation factor family 320995 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 320997 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 320997 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 321003 metallopeptidase family M24 321018 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 321019 7 transmembrane receptor (rhodopsin family) 321020 7 transmembrane receptor (rhodopsin family) 321021 7 transmembrane receptor (rhodopsin family) 326305 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 326624 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 327655 Histidine acid phosphatase 327657 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 327729 Ribosomal protein L11, RNA binding domain 327729 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 327731 7 transmembrane receptor (rhodopsin family) 327734 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 327736 Ribosomal protein L11, RNA binding domain 327736 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 327737 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 327741 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 327754 ENV polyprotein (coat polyprotein) 327761 Tetraspanin family 327762 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins 327762 UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t 327767 Proteasome A-type and B-type 327775 L1 transposable element 327793 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 327794 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 327799 Zn-finger in ubiquitin-hydrolases and other protein 327803 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 327803 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 327806 ENV polyprotein (coat polyprotein) 327807 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 327807 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 327807 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 327810 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 327814 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 327815 Sulfotransferase protein 327816 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 327816 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 327819 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n 327819 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 327826 PTB domain (IRS-1 type) 327826 PTB domain (IRS-1 type) 327827 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 327827 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 327827 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 327827 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 327828 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 327830 Protein kinase domain 327836 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 327840 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 327842 Gelsolin repeat 327842 Villin headpiece domain 327843 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 327843 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 327844 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 327853 7 transmembrane receptor (rhodopsin family) 327855 7 transmembrane receptor (rhodopsin family) 327856 7 transmembrane receptor (rhodopsin family) 327859 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 327864 Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain i 327868 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 327870 Ribosomal L39 protein 327871 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 327871 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 327873 VHS domain. Domain present in VPS-27, Hrs and STAM 327873 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 327878 Ribosomal protein S5, C-terminal domain 327880 Ribosomal protein L11, RNA binding domain 327883 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 327884 Ribosomal L15 327884 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 327886 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 327889 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 327889 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 327890 ENV polyprotein (coat polyprotein) 327891 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 327891 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 327892 ENV polyprotein (coat polyprotein) 327900 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 327907 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 327907 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 327907 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 327919 7 transmembrane receptor (rhodopsin family) 327920 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 327920 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 327923 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 327923 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 327923 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 327923 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 327925 Ribosomal protein S28e 327929 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 327930 RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases 327941 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 327942 Uncharacterized LmbE-like protein, COG2120 327944 HMG (high mobility group) box 327951 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 327956 Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furth 327963 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 327965 7 transmembrane receptor (rhodopsin family) 327972 Ribosomal protein S2 327979 Ribosomal protein L11, RNA binding domain 327979 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 327981 WAP-type (Whey Acidic Protein) 'four-disulfide core' 327983 Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn 327991 Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase 327995 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 328022 Protein kinase domain 328026 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328026 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328026 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328035 Fatty acid desaturase 328042 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328050 HMG (high mobility group) box 328054 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 328054 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 328059 Amino acid permease 328062 Phosphoglycerate kinase 328063 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 328076 Leucine Rich Repeat. CAUTION: This Pfam may not find all Leucine Rich Repeats in a protein. Leucine Rich Repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in 328076 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 328076 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 328077 Ribosomal protein L36e 328079 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 328080 Ribosomal protein S5, N-terminal domain 328081 Ribosomal protein L24e 328083 Phosphatidylethanolamine-binding protein 328089 Ribosomal protein S26e 328090 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328092 Uncharacterized ACR, COG1490 328095 Ribosomal protein S26e 328099 Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph 328102 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 328104 Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA 328106 Fibronectin type III domain 328106 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 328106 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 328107 FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1 328107 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 328107 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 328115 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328115 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328115 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328115 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328116 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 328119 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 328123 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328133 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 328140 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 328141 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328143 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 328146 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 328148 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 328149 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 328158 Macrophage migration inhibitory factor (MIF) 328167 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 328169 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328169 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328173 Protein kinase domain 328174 SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily 328174 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 328176 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 328177 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328178 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 328182 Ribosomal protein S5, N-terminal domain 328188 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 328196 7 transmembrane receptor (rhodopsin family) 328197 Ribosomal L29e protein family 328198 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 328200 7 transmembrane receptor (rhodopsin family) 328201 Core histone H2A/H2B/H3/H4 328201 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 328204 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 328205 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328205 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328205 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328205 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328208 Core histone H2A/H2B/H3/H4 328208 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 328211 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328226 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328230 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 328237 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 328247 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 328248 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 328251 Core histone H2A/H2B/H3/H4 328258 Mitochondrial carrier protein 328258 Mitochondrial carrier protein 328266 ENV polyprotein (coat polyprotein) 328268 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 328269 Low molecular weight phosphotyrosine protein phosphatase 328274 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 328279 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 328280 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 328282 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero 328292 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 328297 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328297 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328297 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328302 Ribosomal protein S11 328306 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328306 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328311 NAC domain 328317 Ribosomal S17 328323 Protein kinase domain 328329 Protein kinase domain 328329 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 328340 Protein of unknown function (DUF229). Members of this family are uncharacterised. They are 500-1200 amino acids in length and share a long region conservation that probably corresponds to several domains 328342 Ribosomal protein L14p/L23e 328343 Ribosomal protein L34e 328344 Ribosomal protein S5, N-terminal domain 328351 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 328355 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328361 L1 transposable element 328374 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328374 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328381 SH2 domain 328386 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328386 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328386 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328395 7 transmembrane receptor (rhodopsin family) 328396 7 transmembrane receptor (rhodopsin family) 328397 7 transmembrane receptor (rhodopsin family) 328401 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 328403 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328403 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328404 ENV polyprotein (coat polyprotein) 328410 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 328413 Uncharacterized ACR, COG1579 328413 HlyD family secretion protein 328413 Myosin head (motor domain) 328413 Intermediate filament protein 328413 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 328413 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 328413 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 328413 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 328415 IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th 328417 BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b 328417 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 328418 ATP synthase subunit C 328423 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328423 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328423 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328424 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 328432 CAP-Gly domain. CAP stands for cytoskeleton-associated proteins 328440 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 328440 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 328451 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 328457 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328459 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 328464 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 328467 Low molecular weight phosphotyrosine protein phosphatase 328478 7 transmembrane receptor (rhodopsin family) 328481 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 328482 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 328486 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 328491 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 328491 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 328496 UDP-glucoronosyl and UDP-glucosyl transferase 328500 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 328504 Actin 328508 Ribosomal protein L13 328518 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328519 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 328523 ENV polyprotein (coat polyprotein) 328523 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 328528 Ribonucleotide reductase, small chain 328529 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 328532 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 328533 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 328537 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 328539 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 328548 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 328549 HMG (high mobility group) box 328551 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 328572 KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun 328572 TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1 328572 KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun 328580 Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin 328585 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328586 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 328595 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328595 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328595 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328595 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328599 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 328600 7 transmembrane receptor (rhodopsin family) 328616 7 transmembrane receptor (rhodopsin family) 328617 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 328621 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328625 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 328635 Ribosomal L29e protein family 328640 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 328648 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328648 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328648 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328672 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 328672 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 328689 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328689 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328689 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328689 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328691 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 328695 Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1 328695 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 328696 7 transmembrane receptor (rhodopsin family) 328697 7 transmembrane receptor (rhodopsin family) 328698 7 transmembrane receptor (rhodopsin family) 328699 pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel 328699 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 328701 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 328703 Actin 328707 Eukaryotic initiation factor 4E 328714 Ribosomal protein L19e 328715 Protein kinase domain 328730 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 328731 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 328743 Ribosomal protein S18 328749 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 328751 Enolase, N-terminal domain 328751 Enolase, C-terminal TIM barrel domain 328754 Proteasome A-type and B-type 328755 Polyprenyl synthetase 328758 7 transmembrane receptor (metabotropic glutamate family) 328759 7 transmembrane receptor (metabotropic glutamate family) 328759 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 328760 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328761 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 328764 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328778 ADP-ribosylation factor family 328778 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 328778 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 328779 Sulfotransferase protein 328780 Trypsin 328781 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 328781 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 328782 Trypsin 328790 Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of 328792 7 transmembrane receptor (rhodopsin family) 328795 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 328795 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 328803 Class II histocompatibility antigen, beta domain 328804 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 328805 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 328807 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 328809 Class I Histocompatibility antigen, domains alpha 1 and 2 328810 Ribosomal protein L6 328812 7 transmembrane receptor (rhodopsin family) 328814 Mitochondrial carrier protein 328815 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 328826 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 328831 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 328832 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328832 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328842 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328842 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328842 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328844 ENV polyprotein (coat polyprotein) 328846 Mitochondrial carrier protein 328847 Mitochondrial carrier protein 328849 Ribosomal protein L34e 328851 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 328854 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 328854 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 328860 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 328874 F-box domain 328877 WD domain, G-beta repeat 328878 Ribosomal L28e protein family 328879 Ribosomal protein L36e 328880 ENV polyprotein (coat polyprotein) 328880 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 328881 UDP-glucoronosyl and UDP-glucosyl transferase 328882 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 328883 UDP-glucoronosyl and UDP-glucosyl transferase 328884 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 328889 ENV polyprotein (coat polyprotein) 328890 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 328895 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 328895 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 328897 GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do 328898 L1 transposable element 328899 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328899 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 328899 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328899 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 328899 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328901 Thrombospondin N-terminal -like domain 328902 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 328908 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 328909 Cadherin domain 328909 Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic 328911 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 328911 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 328911 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 328918 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 328927 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 328927 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 328940 Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal 328948 HMG (high mobility group) box 328954 Ribosomal protein L10 328959 Ribosomal L29e protein family 328964 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 328967 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 328967 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 328967 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 328971 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 328973 WD domain, G-beta repeat 328975 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 328992 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 329000 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 329008 Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13, 329020 Ribosomal protein S7e 329023 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 329027 Uncharacterized ACR, YagE family COG1723 329028 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 329029 7 transmembrane receptor (rhodopsin family) 329030 7 transmembrane receptor (rhodopsin family) 329031 ENV polyprotein (coat polyprotein) 329032 Ribosomal protein S2 329037 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 329048 Ribosomal S17 329052 HMG (high mobility group) box 329055 ab-hydrolase associated lipase region 329055 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 329064 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 329064 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 329076 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 329080 Ribosomal L29e protein family 329081 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 329087 14-3-3 protein 329088 Ribosomal protein S15 329089 ENV polyprotein (coat polyprotein) 329089 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 329090 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329090 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329090 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329093 Zinc carboxypeptidase 329094 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329106 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 329111 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 329112 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 329112 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329113 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329116 IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing 329126 Ribosomal protein L13e 329133 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 329136 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 329136 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 329139 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 329141 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329141 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329143 Mitochondrial carrier protein 329144 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 329146 Protein kinase domain 329147 ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p 329149 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 329154 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 329161 Ribosomal protein L6e 329162 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329169 Cytochrome c oxidase subunit Vb 329179 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329179 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329179 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329179 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329194 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 329199 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 329216 Ribosomal L18ae protein family 329218 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329219 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329219 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329220 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329221 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329221 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329221 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329222 F-actin capping protein alpha subunit 329226 Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function 329232 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 329234 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 329234 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 329236 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 329244 Interleukin 10 329248 Phosphatidylinositol 3- and 4-kinase 329249 C2 domain 329251 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 329259 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329259 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329260 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 329260 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 329262 ENV polyprotein (coat polyprotein) 329263 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329263 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329278 Fibronectin type III domain 329278 Fibronectin type III domain 329278 Fibrinogen beta and gamma chains, C-terminal globular domain 329281 GDP dissociation inhibitor 329289 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329289 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329289 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329289 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329293 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 329296 Ribosomal protein L6e 329299 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329299 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329302 Protein kinase domain 329303 CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations 329310 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329319 MYND finger 329320 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 329320 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329322 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 329324 C2 domain 329326 Ribosomal protein S5, C-terminal domain 329327 7 transmembrane receptor (metabotropic glutamate family) 329328 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329328 Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b 329331 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329331 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329331 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329331 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329334 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329334 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329336 mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter 329337 ATP synthase 329339 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 329341 Nucleoside diphosphate kinase 329342 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 329344 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329344 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329355 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329357 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 329357 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 329362 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 329362 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 329363 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329364 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329364 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329378 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 329383 Ribosomal protein S2 329384 Peptidyl-tRNA hydrolase 329389 Ribosomal protein L36e 329390 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 329396 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 329398 Ribosomal L28e protein family 329399 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329399 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329400 ENV polyprotein (coat polyprotein) 329403 Ribosomal S3Ae family 329416 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 329417 Low-density lipoprotein receptor domain class A 329419 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 329421 Myosin head (motor domain) 329424 Adenylate kinase 329432 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 329435 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 329440 Enolase, N-terminal domain 329440 Enolase, C-terminal TIM barrel domain 329440 Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain 329441 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329443 L1 transposable element 329443 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 329445 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 329445 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 329447 7 transmembrane receptor (rhodopsin family) 329448 7 transmembrane receptor (rhodopsin family) 329450 7 transmembrane receptor (rhodopsin family) 329454 7 transmembrane receptor (rhodopsin family) 329457 7 transmembrane receptor (rhodopsin family) 329458 7 transmembrane receptor (rhodopsin family) 329459 7 transmembrane receptor (rhodopsin family) 329460 7 transmembrane receptor (rhodopsin family) 329461 7 transmembrane receptor (rhodopsin family) 329462 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 329463 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329470 Aminotransferase class I and II 329472 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 329473 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 329474 Ribosomal L15 329477 Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits 329482 Doublecortin 329482 Doublecortin 329485 Ribosomal protein L13e 329486 Cyclophilin type peptidyl-prolyl cis-trans isomerase 329489 7 transmembrane receptor (rhodopsin family) 329493 HMG (high mobility group) box 329502 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso 329502 Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop 329504 Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence 329509 Protein kinase domain 329512 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329516 Protein kinase domain 329520 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 329521 Eukaryotic ribosomal protein L18 329522 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329522 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329522 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329522 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329524 Thrombospondin type 1 domain 329524 AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges 329528 Spumavirus gag protein 329532 Hsp90 protein 329533 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 329539 Endomembrane protein 70 329547 LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 329547 pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar 329553 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329553 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329553 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329553 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329555 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329555 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329560 Cyclophilin type peptidyl-prolyl cis-trans isomerase 329567 Helicase. This family consists of Helicases from the Herpes viruses. Helicases are responsible for the unwinding of DNA and are essential for replication and completion of the viral life cycle 329573 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 329574 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 329579 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 329581 Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat 329585 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329586 Ribosomal protein L15 329589 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329589 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329592 ENV polyprotein (coat polyprotein) 329592 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 329598 ENV polyprotein (coat polyprotein) 329602 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 329603 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329635 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 329652 Carboxylesterase 329653 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 329653 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329655 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 329656 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329656 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329660 7 transmembrane receptor (metabotropic glutamate family) 329660 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 329662 ENV polyprotein (coat polyprotein) 329662 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329662 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 329662 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 329666 7 transmembrane receptor (rhodopsin family) 329667 Zinc finger, C3HC4 type (RING finger) 329669 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 329671 Class I Histocompatibility antigen, domains alpha 1 and 2 329674 Intermediate filament protein 329675 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329675 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329681 Ribosomal protein S5, C-terminal domain 329686 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329686 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329692 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 329693 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 329694 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 329714 Core histone H2A/H2B/H3/H4 329715 Core histone H2A/H2B/H3/H4 329720 HMG (high mobility group) box 329723 Translationally controlled tumor protein 329725 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 329727 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 329727 uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain 329727 DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways 329732 Glycosyl hydrolases family 18 329736 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329736 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329736 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329736 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329749 Ribosomal protein S6e 329757 Ribosomal protein L11, RNA binding domain 329757 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 329761 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329761 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329764 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329777 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this 329777 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this 329777 GHMP kinases putative ATP-binding protein 329777 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f 329778 Eukaryotic porin 329779 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 329783 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329786 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 329786 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 329787 ENV polyprotein (coat polyprotein) 329787 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329787 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 329787 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 329787 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 329788 L1 transposable element 329788 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 329790 ENV polyprotein (coat polyprotein) 329791 Ribosomal protein S12 329792 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 329799 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329801 Oxidoreductase FAD-binding domain 329801 Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity 329804 Ribosomal protein S24e 329808 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329808 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329810 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 329818 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329821 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329823 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329823 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329823 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329824 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 329828 Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases 329841 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 329846 Translationally controlled tumor protein 329855 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329855 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329857 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329857 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329863 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 329864 HMG (high mobility group) box 329865 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329867 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329867 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329867 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329869 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 329874 Ribosomal protein L34e 329879 Interferon alpha/beta domain 329881 Interferon alpha/beta domain 329882 Interferon alpha/beta domain 329885 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329889 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 329889 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 329890 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329893 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 329894 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 329920 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329920 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329921 ENV polyprotein (coat polyprotein) 329921 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 329921 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329921 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 329924 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 329925 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 329926 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 329927 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 329928 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 329931 CAP protein 329932 Core histone H2A/H2B/H3/H4 329934 Fork head domain 329936 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 329937 Cyclin-dependent kinase regulatory subunit 329941 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 329948 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 329948 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 329948 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 329948 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 329951 C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family 329953 Ribosomal protein S8 329954 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 329954 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 329957 Ribosomal protein S27 329959 Transcription factor S-II (TFIIS) 329962 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 329963 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 329971 Ribosomal protein S5, N-terminal domain 329971 Ribosomal protein S5, C-terminal domain 329972 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 329976 Mitochondrial carrier protein 329977 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 329977 FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif 329983 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 329987 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 329987 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 329991 HMG (high mobility group) box 329993 Carboxylesterase 329994 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 329996 HMG (high mobility group) box 330011 Agrin NtA domain. Agrin is a multidomain heparan sulphate proteoglycan, that is a key organizer for the induction of postsynaptic specializations at the neuromuscular junction. Binding of agrin to basement membranes requires the amino terminal (NtA) domai 330012 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 330014 Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue 330017 Palmitoyl protein thioesterase 330018 Palmitoyl protein thioesterase 330018 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330020 HMG (high mobility group) box 330021 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 330022 ENV polyprotein (coat polyprotein) 330027 Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym 330028 Ribosomal protein S27 330032 Ribosomal protein L31e 330034 ENV polyprotein (coat polyprotein) 330034 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 330034 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 330037 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330037 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330037 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330042 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 330046 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 330051 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 330057 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330057 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330062 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 330064 Sodium:solute symporter family 330067 Zinc finger, C3HC4 type (RING finger) 330079 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330079 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330089 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330093 ADP-ribosylation factor family 330093 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 330094 Ribosomal protein L13e 330119 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 330119 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 330122 Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds 330124 Ribosomal protein L13e 330125 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330125 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330125 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330129 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 330129 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 330130 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330130 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330136 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330136 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330136 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330149 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 330155 Ribosomal protein L34e 330158 7 transmembrane receptor (metabotropic glutamate family) 330159 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 330160 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330161 SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation 330167 Protein kinase domain 330167 PH domain. PH stands for pleckstrin homology 330171 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly 330193 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 330200 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 330206 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330206 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330206 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330206 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330207 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330213 WD domain, G-beta repeat 330216 Metallo-beta-lactamase superfamily 330216 Metallo-beta-lactamase superfamily 330222 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 330223 Fibronectin type III domain 330230 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 330236 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330242 DNA directed RNA polymerase, 7 kDa subunit 330250 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330251 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330251 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330251 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330257 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 330258 L1 transposable element 330259 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 330260 Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine 330260 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 330260 Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H 330264 Actin 330268 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 330269 Ribosomal protein L19e 330272 HMG (high mobility group) box 330273 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 330275 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330287 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 330294 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 330301 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330306 Cadherin domain 330308 Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge 330309 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 330326 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 330327 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330328 ENV polyprotein (coat polyprotein) 330328 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 330330 ENV polyprotein (coat polyprotein) 330333 ENV polyprotein (coat polyprotein) 330333 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 330334 ENV polyprotein (coat polyprotein) 330340 Ribosomal protein S7e 330344 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330344 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330344 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330344 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330348 Ribosomal protein L34e 330356 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 330357 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 330357 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330358 Cyclophilin type peptidyl-prolyl cis-trans isomerase 330359 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330359 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330364 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330366 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 330370 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330370 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330380 Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family 330387 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330388 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 330396 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 330396 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 330406 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 330411 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330412 ENV polyprotein (coat polyprotein) 330412 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 330412 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 330423 Ribosomal protein S5, C-terminal domain 330423 Ribosomal protein S5, N-terminal domain 330424 HMG (high mobility group) box 330429 Lectin C-type domain. This family includes both long and short form C-type 330430 Protein kinase domain 330434 Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family 330436 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 330440 Sulfotransferase protein 330441 HMG (high mobility group) box 330443 S25 ribosomal protein 330444 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 330444 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 330445 MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function 330445 RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3 330449 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330449 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330449 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330450 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 330450 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 330453 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 330454 7 transmembrane receptor (rhodopsin family) 330454 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330455 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330456 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330457 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330459 Mo25 protein family 330463 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330463 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330465 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330465 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330465 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330465 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330466 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 330467 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330470 Fibronectin type II domain 330471 Fibronectin type II domain 330474 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 330475 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 330476 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 330477 eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors 330478 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330478 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330488 Zinc finger, C3HC4 type (RING finger) 330492 ENV polyprotein (coat polyprotein) 330495 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330505 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 330506 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 330507 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 330509 Ribosomal family S4e 330511 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 330515 PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline 330517 ENV polyprotein (coat polyprotein) 330517 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 330519 Trypsin 330525 Eukaryotic ribosomal protein L18 330527 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 330531 7 transmembrane receptor (rhodopsin family) 330532 ENV polyprotein (coat polyprotein) 330532 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 330532 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 330534 7 transmembrane receptor (rhodopsin family) 330535 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330548 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 330558 Tetraspanin family 330560 Actin 330565 NUDIX domain 330568 Ribosomal protein S28e 330570 Ribosomal protein L31e 330574 ENV polyprotein (coat polyprotein) 330581 7 transmembrane receptor (metabotropic glutamate family) 330581 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 330582 7 transmembrane receptor (metabotropic glutamate family) 330583 7 transmembrane receptor (metabotropic glutamate family) 330584 7 transmembrane receptor (metabotropic glutamate family) 330585 7 transmembrane receptor (rhodopsin family) 330586 7 transmembrane receptor (rhodopsin family) 330587 7 transmembrane receptor (rhodopsin family) 330588 7 transmembrane receptor (rhodopsin family) 330589 7 transmembrane receptor (rhodopsin family) 330590 7 transmembrane receptor (metabotropic glutamate family) 330591 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330605 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 330609 Conserved nucleoporin domain 330613 7 transmembrane receptor (rhodopsin family) 330614 7 transmembrane receptor (rhodopsin family) 330615 7 transmembrane receptor (rhodopsin family) 330616 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 330617 Ubiquitin carboxyl-terminal hydrolase family 2 330621 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 330622 7 transmembrane receptor (rhodopsin family) 330623 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 330626 7 transmembrane receptor (rhodopsin family) 330627 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 330627 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 330634 ENV polyprotein (coat polyprotein) 330639 Nucleoside diphosphate kinase 330643 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330646 Sulfotransferase protein 330647 Ubiquitin carboxyl-terminal hydrolase family 2 330654 Protein kinase domain 330656 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 330657 Trypsin 330662 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 330666 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 330675 von Willebrand factor type D domain 330679 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330679 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330683 7 transmembrane receptor (rhodopsin family) 330683 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330684 7 transmembrane receptor (rhodopsin family) 330684 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 330685 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330686 Ribosomal protein L23 330688 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 330688 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330689 Ribosomal protein L23 330693 Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl 330721 Protein kinase domain 330723 Trypsin 330723 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 330724 PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown 330724 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 330727 S25 ribosomal protein 330731 Zinc-binding dehydrogenase 330731 Conserved hypothetical protein 95 330731 Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains 330731 Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat 330735 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 330735 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 330743 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330743 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330743 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330749 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330753 Cadherin domain 330757 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330758 Integral membrane protein DUF106. This archaebacterial protein family has no known function. Members are predicted to be integral membrane proteins 330758 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 330759 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330759 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330760 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 330761 HMG (high mobility group) box 330765 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 330770 Disintegrin 330770 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 330770 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 330772 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 330777 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330777 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330777 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330778 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 330783 Ribosomal S3Ae family 330785 Ribosomal protein L35Ae 330786 ENV polyprotein (coat polyprotein) 330786 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 330788 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330788 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 330790 Extracellular link domain 330790 Extracellular link domain 330803 Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal 330805 S25 ribosomal protein 330808 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330808 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330810 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 330812 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 330812 Zinc finger, C3HC4 type (RING finger) 330812 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 330813 Galactose binding lectin domain 330814 7 transmembrane receptor (Secretin family) 330814 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 330814 Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coup 330817 Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine s 330825 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330825 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330836 Amino acid permease 330836 Tryptophan/tyrosine permease family 330836 Amino acid permease 330844 Lectin C-type domain. This family includes both long and short form C-type 330862 PMP-22/EMP/MP20/Claudin family 330870 Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl 330873 Ribosomal protein S6e 330880 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 330885 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 330885 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 330886 ENV polyprotein (coat polyprotein) 330887 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 330889 Ribosomal protein S5, N-terminal domain 330889 Ribosomal protein S5, C-terminal domain 330889 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 330890 Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown 330891 HMG (high mobility group) box 330898 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 330898 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 330899 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 330900 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 330900 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 330905 Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid 330914 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 330918 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 330926 7 transmembrane receptor (rhodopsin family) 330928 7 transmembrane receptor (rhodopsin family) 330936 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 330938 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 330948 Ribosomal protein S8e 330957 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 330960 Fibronectin type III domain 330960 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 330961 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 330975 Ribosomal protein S8 330982 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 330983 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 330984 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 330984 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 330986 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 330988 Ribosomal S3Ae family 330991 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 330998 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 330999 Ribosomal L29e protein family 331002 Ribosomal protein S5, C-terminal domain 331004 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu 331004 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 331012 Translationally controlled tumor protein 331014 HMG (high mobility group) box 331026 Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars 331027 F-box domain 331039 14-3-3 protein 331046 Transglutaminase family 331046 Transglutaminase family, C-terminal ig like domain 331046 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 331046 Transglutaminase family 331046 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 331047 SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145 331049 LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known 331050 Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai 331051 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331052 Protein kinase domain 331056 Somatotropin hormone family 331062 Adenylate and Guanylate cyclase catalytic domain 331065 L1 transposable element 331066 Helix-loop-helix DNA-binding domain 331068 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331069 Cadherin domain 331071 ENV polyprotein (coat polyprotein) 331072 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331072 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331074 Apoptosis regulator proteins, Bcl-2 family 331080 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331081 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331085 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331085 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331085 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331089 Defensin propeptide 331091 Defensin propeptide 331094 L1 transposable element 331101 Protein kinase domain 331102 14-3-3 protein 331103 7 transmembrane receptor (rhodopsin family) 331104 Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family 331104 pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem 331126 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 331133 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 331134 7 transmembrane receptor (metabotropic glutamate family) 331138 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 331142 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 331145 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 331146 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 331148 Protein kinase domain 331149 ENV polyprotein (coat polyprotein) 331149 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 331153 7 transmembrane receptor (metabotropic glutamate family) 331161 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 331163 7 transmembrane receptor (metabotropic glutamate family) 331175 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331176 7 transmembrane receptor (metabotropic glutamate family) 331177 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 331178 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 331182 ENV polyprotein (coat polyprotein) 331182 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 331189 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 331192 7 transmembrane receptor (rhodopsin family) 331198 7 transmembrane receptor (rhodopsin family) 331205 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331205 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331209 Somatotropin hormone family 331210 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331212 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331215 Somatotropin hormone family 331221 7 transmembrane receptor (metabotropic glutamate family) 331224 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 331226 Cytochrome C and Quinol oxidase polypeptide I 331236 7 transmembrane receptor (rhodopsin family) 331239 7 transmembrane receptor (rhodopsin family) 331241 7 transmembrane receptor (rhodopsin family) 331242 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 331244 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331247 5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF i 331252 7 transmembrane receptor (metabotropic glutamate family) 331256 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331262 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 331264 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331267 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331267 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331274 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331276 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 331285 ENV polyprotein (coat polyprotein) 331288 Protein kinase domain 331297 Protein kinase domain 331301 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331306 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331307 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331310 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331323 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 331330 Adenylate and Guanylate cyclase catalytic domain 331331 Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related 331335 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 331345 7 transmembrane receptor (metabotropic glutamate family) 331348 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331348 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331348 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 331355 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331357 Ribosomal L18ae protein family 331358 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 331360 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 331360 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 331362 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331365 L1 transposable element 331376 NUDIX domain 331378 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 331381 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 331382 Eukaryotic porin 331383 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 331389 ENV polyprotein (coat polyprotein) 331389 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331389 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 331393 Ribosomal protein L6 331394 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331398 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331399 Ribosomal protein S27 331400 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 331403 Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that 331403 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331404 Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and C. elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in S. cerevisiae. The HAM1 protein protects the cell from HAP, eithe 331409 7 transmembrane receptor (rhodopsin family) 331410 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 331411 ENV polyprotein (coat polyprotein) 331413 ATP synthase 331414 Pou domain - N-terminal to homeobox domain 331415 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 331417 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 331420 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331428 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331429 Ribosomal L10 331435 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331439 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331439 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331439 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331440 Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o 331443 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331443 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331443 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331445 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331446 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 331448 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 331449 Papain family cysteine protease 331451 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 331454 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 331454 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 331457 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331457 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331458 ENV polyprotein (coat polyprotein) 331459 Actin 331462 Translationally controlled tumor protein 331462 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 331463 Ribosomal protein L21e 331466 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 331467 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 331468 Glutamine amidotransferase class-I 331469 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331469 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331471 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 331474 Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved 331476 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 331480 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 331482 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 331483 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 331488 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 331489 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331490 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 331491 Fibronectin type III domain 331493 7 transmembrane receptor (rhodopsin family) 331494 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 331494 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 331496 Kinesin motor domain 331498 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331498 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331498 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331500 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331502 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331504 ENV polyprotein (coat polyprotein) 331504 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331505 HMG (high mobility group) box 331506 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331507 Low molecular weight phosphotyrosine protein phosphatase 331510 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331512 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 331514 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331515 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 331515 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 331522 Hsp90 protein 331523 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 331526 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 331526 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 331535 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 331535 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 331536 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 331539 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 331542 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 331543 HMG (high mobility group) box 331546 Sodium:neurotransmitter symporter family 331551 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 331552 Actin 331553 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 331557 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 331559 ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity 331559 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 331559 Phosphoribosylglycinamide synthetase, B domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosph 331560 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331560 Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal 331560 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 331560 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331562 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331564 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 331566 Connexin 331570 Ribosomal protein S28e 331571 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331571 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331573 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 331582 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 331585 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 331587 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 331589 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 331599 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331611 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 331614 Ribosomal protein S6e 331615 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 331616 ENV polyprotein (coat polyprotein) 331616 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 331619 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331626 pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I 331646 7 transmembrane receptor (rhodopsin family) 331650 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331653 ENV polyprotein (coat polyprotein) 331657 Ribosomal protein S2 331665 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 331666 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 331676 Ribosomal protein L19e 331698 7 transmembrane receptor (rhodopsin family) 331703 RNA polymerase Rpb4 331705 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 331708 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 331714 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 331715 metallopeptidase family M24 331722 L1 transposable element 331724 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 331727 Ribosomal protein L6 331740 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331740 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331746 Protein kinase domain 331750 Tropomyosin 331750 Myosin head (motor domain) 331750 Intermediate filament protein 331750 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 331750 Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown 331750 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 331752 Anaphase-promoting complex, subunit 10 (APC10) 331755 7 transmembrane receptor (rhodopsin family) 331756 7 transmembrane receptor (rhodopsin family) 331757 7 transmembrane receptor (rhodopsin family) 331758 7 transmembrane receptor (rhodopsin family) 331762 Ribosomal L15 331807 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 331816 Ribosomal protein L11, RNA binding domain 331824 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331826 Death effector domain 331830 Ribosomal L29e protein family 331844 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 331852 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 331857 Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio 331860 Cyclophilin type peptidyl-prolyl cis-trans isomerase 331861 Ribosomal protein L3 331868 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331871 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331873 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331875 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331875 Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv 331876 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331878 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331879 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331880 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331881 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331882 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331883 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331884 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331885 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 331887 Eukaryotic initiation factor 1A 331888 SKIP/SNW domain. This domain is found in chromatin proteins 331890 7 transmembrane receptor (rhodopsin family) 331891 Protein kinase domain 331893 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331893 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331897 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 331900 Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER 331901 Core histone H2A/H2B/H3/H4 331904 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 331909 Core histone H2A/H2B/H3/H4 331909 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 331910 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331910 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331921 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 331921 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 331923 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 331924 Cyclophilin type peptidyl-prolyl cis-trans isomerase 331926 Transglutaminase family, C-terminal ig like domain 331934 Uncharacterized protein family UPF0024 331935 Ribosomal protein S6e 331950 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 331950 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 331955 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 331955 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 331984 Protein kinase domain 331985 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 331986 Ribosomal L29e protein family 331987 Kinesin motor domain 331994 Choline/Carnitine o-acyltransferase 331999 GTP cyclohydrolase I. This family includes GTP cyclohydrolase enzymes and a family of related bacterial proteins 332004 Protein kinase domain 332005 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 332011 Translation initiation factor SUI1 332012 SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop 332018 Phosphatidylethanolamine-binding protein 332024 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 332052 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 332060 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 332061 HMG (high mobility group) box 332062 Ribosomal L22e protein family 332072 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 332073 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 332076 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332077 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 332078 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 332082 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 332091 ENV polyprotein (coat polyprotein) 332091 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 332094 Ribosomal L15 332095 Fibrillarin 332098 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 332100 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 332100 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 332102 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 332102 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 332109 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 332110 Protein kinase domain 332112 Thymosin beta-4 family 332116 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 332118 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 332125 Ribosomal L29e protein family 332131 Tropomyosin 332131 Uncharacterized ACR, COG1579 332131 Intermediate filament protein 332131 Not1 N-terminal domain, CCR4-Not complex component 332131 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 332131 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 332131 Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers 332131 ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a 332131 IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation 332131 Outer membrane efflux protein. The OEP family (Outer membrane efflux protein) form trimeric channels that allow export of a variety of substrates in Gram negative bacteria. Each member of this family is composed of two repeats. The trimeric channel is com 332131 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 332131 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 332131 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 332143 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 332147 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332148 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332158 E1-E2 ATPase 332158 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 332160 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 332163 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 332175 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 332189 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 332189 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 332198 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332203 Ribosomal protein S5, C-terminal domain 332215 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 332217 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332221 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 332222 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 332224 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 332229 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 332233 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 332235 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332242 HMG14 and HMG17 332244 KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG 332245 ADP-ribosylation factor family 332245 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 332256 Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene 332257 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 332257 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 332261 Ribosomal protein L31e 332261 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 332268 Ribosomal protein L36e 332273 UDP-glucoronosyl and UDP-glucosyl transferase 332274 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 332275 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332279 Ribosomal protein S5, C-terminal domain 332279 Ribosomal protein S5, N-terminal domain 332280 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332281 WD domain, G-beta repeat 332283 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 332288 Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity 332288 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 332297 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332304 L1 transposable element 332304 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 332310 Ribosomal protein S5, C-terminal domain 332317 ENV polyprotein (coat polyprotein) 332317 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 332333 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 332333 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 332342 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 332359 CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di 332359 DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which 332361 Domain of Unknown Function (DUF408) 332365 7 transmembrane receptor (rhodopsin family) 332378 Cyclophilin type peptidyl-prolyl cis-trans isomerase 332396 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 332402 ENV polyprotein (coat polyprotein) 332404 Ribosomal protein S5, C-terminal domain 332404 Ribosomal protein S5, N-terminal domain 332427 Transglycosylase SLT domain. This family is distantly related to pfam00062 332431 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 332450 7 transmembrane receptor (Secretin family) 332450 Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain 332462 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 332474 Trypsin 332474 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 332482 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 332482 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 332488 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 332494 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 332499 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 332511 dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim 332512 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 332537 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 332537 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 332548 Ribosomal protein L31e 332555 Ribosomal L29e protein family 332560 Protein kinase domain 332568 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 332570 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 332588 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 332588 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 332590 Ribosomal protein L36e 332596 Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenyla 332597 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 332604 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 332604 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 332627 Ribosomal L15 332630 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 332630 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 332631 MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains 332631 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 332645 Ribosomal protein L21e 332652 HMG (high mobility group) box 332664 Nucleoside diphosphate kinase 332671 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 332672 HMG (high mobility group) box 332677 ENV polyprotein (coat polyprotein) 332677 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332683 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 332693 ATP synthase subunit C 332708 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332709 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332714 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332716 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332718 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332719 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332720 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 332721 Ribosomal protein S24e 332722 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 332723 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 332724 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332724 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 332726 Eukaryotic-type carbonic anhydrase 332730 ENV polyprotein (coat polyprotein) 332737 ENV polyprotein (coat polyprotein) 332737 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332738 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt 332741 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 332742 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332750 7 transmembrane receptor (rhodopsin family) 332764 Cyclophilin type peptidyl-prolyl cis-trans isomerase 332770 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 332775 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 332784 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 332786 Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins 332787 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 332790 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 332790 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 332793 O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production 332800 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 332801 Ribosomal protein L6e 332822 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 332844 Protein kinase domain 332844 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 332860 ENV polyprotein (coat polyprotein) 332860 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 332860 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332860 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 332862 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 332862 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332866 S25 ribosomal protein 332876 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 332878 ENV polyprotein (coat polyprotein) 332880 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 332880 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 332880 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 332881 L1 transposable element 332897 L1 transposable element 332904 CTF/NF-I family 332934 MYND finger 332937 Transcription factor AP-2 332939 Ribosomal protein S19 332958 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 332971 Amiloride-sensitive sodium channel 332974 Protein kinase domain 332974 ENV polyprotein (coat polyprotein) 332979 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 332987 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 332993 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 332999 Fibronectin type III domain 332999 Protein-tyrosine phosphatase 333010 Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene 333018 ENV polyprotein (coat polyprotein) 333022 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333027 Ribosomal L29e protein family 333046 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 333046 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 333067 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 333067 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 333071 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 333074 'chromo' (CHRromatin Organization MOdifier) domain 333074 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 333105 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333113 Translation initiation factor SUI1 333114 Ribosomal protein L14p/L23e 333116 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333117 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333118 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333119 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333120 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333130 Protein kinase domain 333131 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 333146 Ribosomal L29e protein family 333147 Core histone H2A/H2B/H3/H4 333153 Ribosomal protein L21e 333154 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 333157 Triosephosphate isomerase 333160 Lectin C-type domain. This family includes both long and short form C-type 333162 ENV polyprotein (coat polyprotein) 333162 Poly-adenylate binding protein, unique domain 333163 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333163 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 333166 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 333177 Ribosomal L18ae protein family 333180 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 333182 Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase 333185 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 333187 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 333194 Ribosomal protein L23 333200 7 transmembrane receptor (metabotropic glutamate family) 333201 7 transmembrane receptor (metabotropic glutamate family) 333201 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 333202 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333202 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 333205 7 transmembrane receptor (rhodopsin family) 333206 7 transmembrane receptor (rhodopsin family) 333227 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 333229 ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m 333238 Ribosomal protein L5 333238 ribosomal L5P family C-terminus. This region is found associated with pfam00281 333253 Protein kinase domain 333253 Sulfotransferase protein 333254 Ribosomal protein L23 333254 Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated 333254 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 333256 7 transmembrane receptor (metabotropic glutamate family) 333256 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 333258 Zinc finger, C3HC4 type (RING finger) 333258 ATP-dependent protease La (LON) domain 333260 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 333261 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 333270 Myosin head (motor domain) 333280 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 333288 Ribosomal protein S5, C-terminal domain 333291 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 333291 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 333296 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333307 Zinc finger, C3HC4 type (RING finger) 333307 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 333309 Translation initiation factor SUI1 333314 OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine 333315 Calx-beta domain 333320 Intermediate filament protein 333329 Cyclic nucleotide-binding domain 333329 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 333331 Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold 333331 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 333343 Zinc-binding dehydrogenase 333351 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 333355 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 333356 Lectin C-type domain. This family includes both long and short form C-type 333359 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 333366 Cyclin-dependent kinase regulatory subunit 333368 7 transmembrane receptor (rhodopsin family) 333369 7 transmembrane receptor (rhodopsin family) 333380 7 transmembrane receptor (rhodopsin family) 333384 Fork head domain 333395 Ribosomal protein S5, C-terminal domain 333401 Ribosomal protein S27 333412 Ribosomal protein S8 333412 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 333426 ENV polyprotein (coat polyprotein) 333433 NAD-dependent glycerol-3-phosphate dehydrogenase 333439 Ribosomal protein L19e 333443 Ribosomal protein L23 333443 Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a 333444 7 transmembrane receptor (rhodopsin family) 333446 7 transmembrane receptor (rhodopsin family) 333447 Protein kinase domain 333452 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 333455 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 333456 Homeobox domain 333460 Protein kinase domain 333461 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333462 SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. The 333466 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333469 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333473 Zinc finger C-x8-C-x5-C-x3-H type (and similar) 333474 7 transmembrane receptor (metabotropic glutamate family) 333475 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 333477 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333483 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333488 Ribosomal L22e protein family 333491 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 333492 Eukaryotic porin 333492 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 333497 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333502 Repeat in ubiquitin-activating (UBA) protein 333506 Ribosomal L29e protein family 333506 ENV polyprotein (coat polyprotein) 333511 R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA 333511 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 333515 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333516 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 333522 ENV polyprotein (coat polyprotein) 333522 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 333533 ENV polyprotein (coat polyprotein) 333534 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 333536 ENV polyprotein (coat polyprotein) 333539 Ribosomal protein L6 333543 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 333543 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 333550 HMG (high mobility group) box 333555 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333556 Ribosomal protein L13e 333556 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333557 Ribosomal protein S5, C-terminal domain 333560 Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran 333561 Phosphoglycerate kinase 333563 Fibronectin type III domain 333564 Fibronectin type III domain 333566 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 333567 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333567 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 333569 ENV polyprotein (coat polyprotein) 333572 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 333573 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 333577 Fructose-bisphosphate aldolase class-I 333582 Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen 333586 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 333586 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 333595 Ribosomal protein L10 333596 Ribosomal protein L6e 333599 Ribosomal protein L23 333608 ENV polyprotein (coat polyprotein) 333614 Death domain 333620 L1 transposable element 333621 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 333626 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 333628 Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor 333629 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 333633 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 333641 L1 transposable element 333649 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 333654 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 333654 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 333660 Translationally controlled tumor protein 333664 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333664 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 333665 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 333669 Sodium:solute symporter family 333672 7 transmembrane receptor (metabotropic glutamate family) 333676 L1 transposable element 333678 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 333679 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 333680 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 333687 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333688 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 333688 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333689 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333691 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333692 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333694 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333695 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333697 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333698 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333699 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333708 WD domain, G-beta repeat 333710 Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for 333711 Class I Histocompatibility antigen, domains alpha 1 and 2 333712 Class I Histocompatibility antigen, domains alpha 1 and 2 333714 Class I Histocompatibility antigen, domains alpha 1 and 2 333715 Class I Histocompatibility antigen, domains alpha 1 and 2 333715 Class I Histocompatibility antigen, domains alpha 1 and 2 333715 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333716 Protein kinase domain 333718 Ribosomal protein L6 333719 Homeobox domain 333727 ENV polyprotein (coat polyprotein) 333727 Class I Histocompatibility antigen, domains alpha 1 and 2 333728 Class I Histocompatibility antigen, domains alpha 1 and 2 333730 B-box zinc finger 333732 Class I Histocompatibility antigen, domains alpha 1 and 2 333735 Class II histocompatibility antigen, beta domain 333737 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 333738 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333739 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 333741 Class I Histocompatibility antigen, domains alpha 1 and 2 333742 Class I Histocompatibility antigen, domains alpha 1 and 2 333742 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333743 Clathrin adaptor complex small chain 333743 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits 333745 Class I Histocompatibility antigen, domains alpha 1 and 2 333746 Class I Histocompatibility antigen, domains alpha 1 and 2 333746 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333747 Class I Histocompatibility antigen, domains alpha 1 and 2 333749 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 333749 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 333750 Class I Histocompatibility antigen, domains alpha 1 and 2 333752 L1 transposable element 333753 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 333755 L1 transposable element 333756 MCM2/3/5 family 333758 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 333759 pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn 333760 von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA 333761 7 transmembrane receptor (rhodopsin family) 333763 Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway 333763 Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran 333764 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 333767 Doublecortin 333769 e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit 333769 Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins 333773 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 333773 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 333775 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 333775 gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro 333778 von Willebrand factor type D domain 333778 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 333778 TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pro 333783 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 333784 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 333788 ADP-ribosylation factor family 333788 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 333791 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333797 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 333806 KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels 333812 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 333813 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 333818 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 333818 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 333822 Ribosomal protein L3 333827 PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi 333827 TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s 333830 Ribosomal protein S7e 333831 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 333831 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 333834 jmjN domain 333834 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 333836 Sodium:solute symporter family 333842 Ribosomal protein S5, C-terminal domain 333842 Ribosomal protein S5, N-terminal domain 333845 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 333846 HIT family 333846 Ribosomal protein S6e 333849 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333850 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333851 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333854 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333855 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333856 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333858 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333859 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333866 Core histone H2A/H2B/H3/H4 333868 ENV polyprotein (coat polyprotein) 333869 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 333872 Zinc finger, C3HC4 type (RING finger) 333873 L1 transposable element 333873 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 333873 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 333875 Class I Histocompatibility antigen, domains alpha 1 and 2 333877 L1 transposable element 333879 7 transmembrane receptor (rhodopsin family) 333883 u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087 333886 Adenosine-deaminase (editase) domain 333886 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 333888 ENV polyprotein (coat polyprotein) 333889 dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate 333896 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333897 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333898 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333899 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333903 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 333906 Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain 333907 Mitochondrial carrier protein 333908 Ribosomal protein L21e 333910 ENV polyprotein (coat polyprotein) 333913 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 333915 L1 transposable element 333916 L1 transposable element 333924 Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli 337867 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t 337868 Helix-loop-helix DNA-binding domain 337868 Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates 337876 Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys 337924 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 338322 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 338323 PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain 338324 S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins 338337 Sec34-like family. Sec34 and Sec35 form a sub-complex, in a seven protein complex that includes Dor1 (PFAM:PF04124). This complex is thought to be important for tether vesicles to the Golgi 338345 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 338346 7 transmembrane receptor (rhodopsin family) 338355 Uncharacterized ACR, COG1579 338355 FKBP-type peptidyl-prolyl cis-trans isomerase 338355 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 338355 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 338364 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 338365 Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha 338370 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 338375 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 338376 Interferon alpha/beta domain 338401 LIM domain. This family represents two copies of the LIM structural domain 338442 7 transmembrane receptor (rhodopsin family) 338442 7 transmembrane receptor (rhodopsin family) 338443 7 transmembrane receptor (rhodopsin family) 338521 Fatty acid hydroxylase 338521 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc 338521 Fatty acid hydroxylase 338521 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 338523 jmjC domain 338523 Lysosome-associated membrane glycoprotein (Lamp) 338540 Ribosomal protein L13 338557 7 transmembrane receptor (rhodopsin family) 338557 7 transmembrane receptor (rhodopsin family) 338559 Cyclophilin type peptidyl-prolyl cis-trans isomerase 338561 Ribosomal protein L23 338563 Protein kinase domain 338567 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 338569 Repair protein Rad1/Rec1/Rad17 338570 NAC domain 338575 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 338590 PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about 338596 Flavivirus glycoprotein, central and dimerisation domains 338596 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 338599 Cyclophilin type peptidyl-prolyl cis-trans isomerase 338599 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 338601 Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold 338605 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 338606 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 338611 Ribosomal protein S26e 338614 Helix-loop-helix DNA-binding domain 338619 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 338623 Fork head domain 338626 Mitochondrial carrier protein 338628 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 338629 Fibronectin type III domain 338637 7 transmembrane receptor (rhodopsin family) 338639 7 transmembrane receptor (rhodopsin family) 338642 Olfactomedin-like domain 338642 7 transmembrane receptor (rhodopsin family) 338644 Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly 338646 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 338649 Ribosomal protein L11, RNA binding domain 338649 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 338652 Actin 338656 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 338660 Homeobox domain 338662 7 transmembrane receptor (rhodopsin family) 338674 7 transmembrane receptor (rhodopsin family) 338675 7 transmembrane receptor (rhodopsin family) 338676 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 338682 C2 domain 338692 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 338699 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 338718 7 transmembrane receptor (rhodopsin family) 338733 Oxysterol-binding protein 338736 Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin 338736 Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis 338736 Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases 338742 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 338746 Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG 338747 7 transmembrane receptor (rhodopsin family) 338751 7 transmembrane receptor (rhodopsin family) 338755 7 transmembrane receptor (rhodopsin family) 338757 Fibronectin type III domain 338761 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 338766 Ribosomal protein S26e 338783 Intermediate filament protein 338785 Intermediate filament protein 338785 Intermediate filament protein 338801 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 338806 ADP-ribosylation factor family 338813 Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family 338815 Lectin C-type domain. This family includes both long and short form C-type 338819 metallopeptidase family M24 338821 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 338829 Antitermination protein 338829 Bacteriophage lysis protein. This protein is involved in host lysis. This family is not considered to be a peptidase according to the MEROPs database 338830 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 338831 NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins 338832 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 338834 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 338834 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 338835 Protein kinase domain 338865 Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across 338869 Ribosomal protein S2 338870 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 338871 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 338872 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 338872 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 338872 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 338878 7 transmembrane receptor (rhodopsin family) 338879 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 338880 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 338902 TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway 338905 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 338912 PH domain. PH stands for pleckstrin homology 338913 Disintegrin 338913 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 338915 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 338919 Ribosomal protein S26e 338922 Ribosomal protein L21e 338952 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 338952 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 338954 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 338958 Eukaryotic ribosomal protein L18 338972 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 338972 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 338973 Cyclic nucleotide-binding domain 338973 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 338978 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 339000 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 339000 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 339010 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 339011 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 339026 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 339027 Ribosomal protein L31e 339039 WSC domain. This domain may be involved in carbohydrate binding 339039 Lectin C-type domain. This family includes both long and short form C-type 339039 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 339039 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 339039 REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six co 339044 PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold 339070 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr 339086 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 339087 Eukaryotic-type carbonic anhydrase 339088 BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems 339094 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 339097 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 339102 Trypsin 339105 Trypsin 339115 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 339120 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 339122 ADP-ribosylation factor family 339122 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 339123 jmjC domain 339133 Disintegrin 339133 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 339133 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 339162 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 339173 Protein kinase domain 339175 ubiE/COQ5 methyltransferase family 339208 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 339212 Protein kinase domain 339221 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle 339221 Sulfatase 339221 Metalloenzyme superfamily. This family includes phosphopentomutase and 2,3-bisphosphoglycerate-independent phosphoglycerate mutase. This family is also related to pfam00245. The alignment contains the most conserved residues that are probably involved in 339221 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 339231 ADP-ribosylation factor family 339241 Intermediate filament protein 339242 Intermediate filament protein 339243 Intermediate filament protein 339244 Intermediate filament protein 339246 MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a 339261 Ribosomal protein L21e 339272 Ribosomal L10 339284 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 339295 RNA dependent RNA polymerase. This family may represent an RNA dependent RNA polymerase. The family also contains the following proteins: 2A protein from bromoviruses putative RNA dependent RNA polymerase from tobamoviruses Non structural polyprotein from 339295 Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons, 339309 Ribosomal protein S8e 339311 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 339319 von Willebrand factor type D domain 339319 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 339320 von Willebrand factor type D domain 339320 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 339321 Ets-domain 339327 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339347 Fibronectin type II domain 339354 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 339365 Ribosomal protein S19 339371 Ribosomal protein L24e 339371 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339377 Lectin C-type domain. This family includes both long and short form C-type 339389 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339390 Lectin C-type domain. This family includes both long and short form C-type 339395 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 339396 Intermediate filament protein 339403 7 transmembrane receptor (rhodopsin family) 339403 7 transmembrane receptor (rhodopsin family) 339408 Cytochrome C and Quinol oxidase polypeptide I 339408 Cytochrome C oxidase subunit II, periplasmic domain 339409 Eukaryotic protein of unknown function, DUF279 339414 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 339416 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 339425 Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6 339451 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 339452 Laminin G domain 339452 Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6 339452 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 339452 Agrin NtA domain. Agrin is a multidomain heparan sulphate proteoglycan, that is a key organizer for the induction of postsynaptic specializations at the neuromuscular junction. Binding of agrin to basement membranes requires the amino terminal (NtA) domai 339452 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 339463 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 339479 Giardia variant-specific surface protein 339479 MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl 339487 Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues 339487 Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues 339488 Transcription factor AP-2 339488 Transcription factor AP-2 339489 Protein kinase domain 339492 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 339501 Trypsin 339509 Repair protein Rad1/Rec1/Rad17 339511 7 transmembrane receptor (rhodopsin family) 339521 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 339530 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 339536 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but func 339542 Ribosomal protein S17 339544 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 339559 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339562 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 339565 7 transmembrane receptor (rhodopsin family) 339575 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 339581 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 339587 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 339589 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 339604 Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil 339605 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 339608 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 339618 Lipase 339628 WD domain, G-beta repeat 339635 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 339639 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 339656 Sodium:solute symporter family 339661 Gamma-glutamyltranspeptidase 339672 Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However 339672 S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea 339692 Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly s 339709 Ribosomal protein L24e 339710 F-actin capping protein, beta subunit 339714 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 339715 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339716 Ribosomal protein S5, C-terminal domain 339716 Ribosomal protein S5, N-terminal domain 339718 Ribosomal protein S2 339721 Spumavirus gag protein 339721 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 339721 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 339736 Intermediate filament protein 339739 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 339744 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 339745 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 339749 RanBP1 domain 339749 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 339752 Homeobox domain 339755 Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues 339756 Ribosomal protein L21e 339761 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 339763 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 339772 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 339776 Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio 339780 Ribosomal protein L21e 339781 Intermediate filament protein 339798 Actin 339799 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 339805 Ribosomal protein L31e 339806 Ribosomal protein S2 339814 Intermediate filament protein 339821 Sulfotransferase protein 339828 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 339829 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 339839 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339843 C2 domain 339843 HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus 339843 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 339844 LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo 339845 Arginosuccinate synthase. This family contains a PP-loop motif 339855 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 339857 Cytochrome C and Quinol oxidase polypeptide I 339858 Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hig 339869 Ribosomal protein L21e 339871 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 339879 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 339881 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 339890 Ribosomal protein L21e 339892 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 339896 Aminotransferase class-V 339896 Pyridoxal-dependent decarboxylase conserved domain 339900 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 339903 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 339905 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 339906 Trypsin 339909 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 339909 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 339939 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 339943 Protein kinase domain 339948 Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids 339951 Cadherin domain 339959 Arginosuccinate synthase. This family contains a PP-loop motif 339965 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 339967 Trypsin 339968 Trypsin 339976 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 339976 Zinc finger, C3HC4 type (RING finger) 339976 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 339983 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 339983 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions 339991 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 339991 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 340014 UDP-glucoronosyl and UDP-glucosyl transferase 340024 Sodium:neurotransmitter symporter family 340026 pfam02910, succ_DH_flav_C, Fumarate reductase/succinate dehydrogenase flavoprotein C-terminal domain. This family contains fumarate reductases, succinate dehydrogenases and L-aspartate oxidases 340029 Ribosomal protein L19e 340042 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif 340052 7 transmembrane receptor (rhodopsin family) 340057 Ribosomal L39 protein 340066 HMG14 and HMG17 340068 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 340075 Sulfatase 340077 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 340081 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 340096 Core histone H2A/H2B/H3/H4 340116 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 340119 Ribosomal protein S5, C-terminal domain 340119 Ribosomal protein S5, N-terminal domain 340120 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 340132 Ribosomal protein L31e 340133 PH domain. PH stands for pleckstrin homology 340135 Ribosomal protein L6e 340147 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 340151 Ribosomal protein S2 340156 Protein kinase domain 340159 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 340167 HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is 340179 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 340183 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 340189 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 340207 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 340215 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 340216 HMG (high mobility group) box 340216 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 340217 Homeobox domain 340221 BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction 340223 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their o 340224 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 340232 Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl 340236 Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti 340237 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 340243 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their o 340245 Mitochondrial carrier protein 340248 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 340248 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 340250 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 340252 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 340260 Homeobox domain 340265 Ribosomal protein L31e 340265 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 340273 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 340273 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 340274 Arginosuccinate synthase. This family contains a PP-loop motif 340274 tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs 340284 GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E 340289 Ets-domain 340289 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 340307 Intermediate filament protein 340311 Intermediate filament protein 340312 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 340326 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 340330 Cyclophilin type peptidyl-prolyl cis-trans isomerase 340331 Ribosomal protein L31e 340333 Class I Histocompatibility antigen, domains alpha 1 and 2 340336 Ribosomal protein L19e 340342 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 340343 Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561 340348 Tetraspanin family 340348 Tetraspanin family 340354 Doublecortin 340358 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 340359 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 340359 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 340361 Homeobox domain 340361 Pou domain - N-terminal to homeobox domain 340368 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 340371 Protein kinase domain 340376 Ribosomal protein L23 340377 Ribosomal protein L23 340379 Ribosomal protein L23 340386 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 340406 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 340413 Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions 340416 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 340416 Transcription initiation factor IIA, gamma subunit, beta-barrel domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multi 340417 Ribosomal L10 340417 Thymosin beta-4 family 340418 PCRF domain. This domain is found in peptide chain release factors 340418 Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons 340421 Eukaryotic ribosomal protein L18 340426 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 340433 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 340441 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 340443 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 340448 Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity 340449 Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III 340452 Transforming growth factor beta like domain 340458 Homeobox domain 340459 Cytochrome C and Quinol oxidase polypeptide I 340459 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 340460 Intermediate filament protein 340472 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 340480 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 340481 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 340487 Interferon alpha/beta domain 340493 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 340499 FMN-dependent dehydrogenase 340499 2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor 340499 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 340499 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 340501 Transcription factor S-II (TFIIS) 340503 7 transmembrane receptor (metabotropic glutamate family) 340509 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 340509 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 340510 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 340518 Lipoxygenase 340518 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 340523 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 340529 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 340530 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 340535 7 transmembrane receptor (rhodopsin family) 340536 Eukaryotic protein of unknown function, DUF279 340539 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 340546 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 340548 Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M 340551 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 340555 7 transmembrane receptor (rhodopsin family) 340561 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 340564 Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellul 340565 Ribosomal protein S2 340565 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 340567 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 340567 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 340567 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 340571 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 340571 Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results 340573 HMG (high mobility group) box 340574 Core histone H2A/H2B/H3/H4 340576 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 340576 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 340578 WD domain, G-beta repeat 340591 Eukaryotic-type carbonic anhydrase 340593 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 340598 Intermediate filament protein 340600 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region 340613 Arginosuccinate synthase. This family contains a PP-loop motif 340614 Ribosomal protein S24e 340623 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 340628 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 340644 Zinc finger, C3HC4 type (RING finger) 340647 C2 domain 340649 Ribosomal protein S26e 340654 ab-hydrolase associated lipase region 340654 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 340663 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 340663 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 340665 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 340671 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 340680 HMG (high mobility group) box 340697 HMG (high mobility group) box 340699 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 340706 Protein kinase domain 340706 von Willebrand factor type A domain 340706 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 340725 Ribosomal protein L19e 340729 Ribosomal protein S8 340733 Glu/Leu/Phe/Val dehydrogenase, dimerisation domain 340733 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 340735 'Cold-shock' DNA-binding domain 340739 tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits 340742 Amiloride-sensitive sodium channel 340745 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 340749 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 340752 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 340753 CUB domain 340757 CAP protein 340765 Homeobox domain 340767 Ribosomal protein S19e 340768 CUB domain 340769 CUB domain 340780 CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh 340780 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 340783 HMG14 and HMG17 340784 Homeobox domain 340786 Ribosomal protein S26e 340808 ENV polyprotein (coat polyprotein) 340810 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 340811 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 340821 Ribosomal protein L13e 340835 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 340840 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 340844 S-adenosyl-L-homocysteine hydrolase 340844 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 340846 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 340852 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 340859 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 340874 ADP-ribosylation factor family 340874 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 340888 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 340896 MAM domain. An extracellular domain found in many receptors 340897 NADH dehydrogenase 340901 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 340913 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 340914 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 340917 Ribosomal protein S2 340931 Ribosomal family S4e 340933 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 340966 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 340967 Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform 340968 Protein kinase domain 340970 Zinc finger, C3HC4 type (RING finger) 340970 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 340973 7 transmembrane receptor (rhodopsin family) 340977 FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin 340977 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 340978 7 transmembrane receptor (rhodopsin family) 340980 7 transmembrane receptor (rhodopsin family) 340996 HMG (high mobility group) box 341002 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 341009 'chromo' (CHRromatin Organization MOdifier) domain 341009 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 341019 Doublecortin 341035 Ribosomal protein L23 341044 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 341056 ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family 341057 7 transmembrane receptor (rhodopsin family) 341058 7 transmembrane receptor (rhodopsin family) 341061 7 transmembrane receptor (rhodopsin family) 341062 7 transmembrane receptor (rhodopsin family) 341065 7 transmembrane receptor (rhodopsin family) 341094 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 341098 Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr 341102 7 transmembrane receptor (rhodopsin family) 341105 7 transmembrane receptor (rhodopsin family) 341112 tRNA synthetases class I (W and Y) 341113 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 341115 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 341116 CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a 341119 Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregatio 341123 Sugar (and other) transporter 341128 7 transmembrane receptor (rhodopsin family) 341130 7 transmembrane receptor (rhodopsin family) 341135 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 341138 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 341140 Actin 341141 7 transmembrane receptor (rhodopsin family) 341142 7 transmembrane receptor (rhodopsin family) 341145 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 341146 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch 341147 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 341149 Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for 341151 7 transmembrane receptor (rhodopsin family) 341152 7 transmembrane receptor (rhodopsin family) 341155 MAS20 protein import receptor 341168 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 341170 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 341172 HMG (high mobility group) box 341180 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 341198 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 341200 Ribosomal S3Ae family 341201 Cadherin domain 341202 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 341202 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 341208 Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain 341218 Cyclophilin type peptidyl-prolyl cis-trans isomerase 341225 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 341226 Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold 341228 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 341230 Arginosuccinate synthase. This family contains a PP-loop motif 341245 Eukaryotic ribosomal protein L18 341254 Aminotransferase class I and II 341256 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 341276 7 transmembrane receptor (rhodopsin family) 341277 Trypsin 341277 CUB domain 341278 7 transmembrane receptor (rhodopsin family) 341279 7 transmembrane receptor (rhodopsin family) 341287 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 341290 Ribosomal protein L6e 341301 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 341305 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 341306 Ribosomal L15 341308 Eukaryotic porin 341313 Cytochrome c oxidase subunit Vb 341315 Poly-adenylate binding protein, unique domain 341315 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 341316 Ribosomal protein L44 341321 Ribosomal protein L21e 341341 Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein 341341 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 341346 tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase 341350 Trypsin 341350 CUB domain 341356 Ribosomal protein L31e 341359 C2 domain 341376 Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a 341383 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 341392 AMP-binding enzyme 341399 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 341404 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 341405 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 341407 Intermediate filament protein 341408 Intermediate filament protein 341409 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 341409 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 341412 Ribosomal protein L31e 341414 Eukaryotic porin 341415 HSF-type DNA-binding 341416 7 transmembrane receptor (rhodopsin family) 341417 7 transmembrane receptor (rhodopsin family) 341418 7 transmembrane receptor (rhodopsin family) 341444 NAC domain 341448 Intermediate filament protein 341449 Intermediate filament protein 341457 Cyclophilin type peptidyl-prolyl cis-trans isomerase 341459 Chaperonin 10 Kd subunit 341465 Ribosomal protein S6e 341470 POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters 341472 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 341473 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 341479 Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding 341483 Chaperonin 10 Kd subunit 341511 Ribosomal protein L23 341512 Ribosomal protein L13 341513 Tetraspanin family 341515 Ribosomal protein S26e 341525 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 341549 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 341555 Ribosomal protein L21e 341556 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 341556 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 341556 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 341558 Homeobox domain 341560 ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain 341565 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 341568 7 transmembrane receptor (rhodopsin family) 341578 Glypican 341583 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 341592 Tub family 341592 Hsp90 protein 341599 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 341604 Ribosomal L29 protein 341616 Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc f 341638 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 341643 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 341651 Lyase 341669 Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi 341669 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 341674 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 341676 Protein kinase domain 341680 Cadherin domain 341700 Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f 341704 Ribosomal protein S2 341706 DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions 341715 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 341720 MmgE/PrpD family. This family includes 2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is required for propionate catabolism. It catalyses the third step of the 2-methylcitric acid cycle 341722 Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea 341728 Ribosomal protein L31e 341732 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 341752 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 341753 PXA domain. This domain is associated with PX domains pfam00787 341760 Cyclophilin type peptidyl-prolyl cis-trans isomerase 341761 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 341776 Ribosomal protein L23 341784 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 341795 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 341796 7 transmembrane receptor (rhodopsin family) 341799 7 transmembrane receptor (rhodopsin family) 341803 Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices 341816 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 341831 Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family 341834 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 341835 Amidase 341835 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 341845 14-3-3 protein 341848 Ribosomal protein S8 341856 7 transmembrane receptor (rhodopsin family) 341858 GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins 341862 ATP synthase 341864 HMG14 and HMG17 341864 Ribosomal protein L21e 341864 Transcription factor TFIID (or TATA-binding protein, TBP) 341868 Ribosomal protein L13 341893 Aminotransferase class I and II 341894 Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger 341897 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 341899 HMG (high mobility group) box 341902 Disintegrin 341902 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop 341902 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 341905 PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins 341910 Ribosomal protein S5, C-terminal domain 341910 Ribosomal protein S5, N-terminal domain 341915 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2 341922 Dynein light chain type 1 341925 Ribosomal protein S26e 341927 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 341949 Ribosomal L15 341963 Ribosomal protein L3 341965 Eukaryotic porin 341973 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 341973 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 341974 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 341974 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 342013 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 342015 Ribosomal protein L6 342032 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 342035 Olfactomedin-like domain 342035 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G 342035 Olfactomedin-like domain 342035 Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G- 342037 Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription 342041 Ribosomal protein S2 342042 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 342042 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 342042 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 342052 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 342052 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 342053 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 342072 'Cold-shock' DNA-binding domain 342080 SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and 342092 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 342101 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 342103 Cyclophilin type peptidyl-prolyl cis-trans isomerase 342113 Uncharacterized ACR, COG1579 342113 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 342113 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 342116 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 342119 Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa 342138 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 342146 AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d 342155 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 342155 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 342156 Glypican 342163 Dynamin GTPase effector domain 342183 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 342206 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 342209 Glypican 342209 ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions 342209 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 342234 F-actin capping protein alpha subunit 342253 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 342261 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 342289 Lectin C-type domain. This family includes both long and short form C-type 342293 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 342315 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 342316 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 342331 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 342341 Cytochrome C and Quinol oxidase polypeptide I 342357 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 342357 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342357 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 342358 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 342370 Ribosomal L10 342372 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 342372 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 342372 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 342372 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 342372 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 342372 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 342374 Intermediate filament protein 342405 Cyclophilin type peptidyl-prolyl cis-trans isomerase 342407 Ribosomal protein L21e 342409 Homeobox domain 342419 Intermediate filament protein 342419 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 342423 Filamin/ABP280 repeat 342424 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 342427 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 342429 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 342432 Homeobox domain 342443 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 342443 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 342444 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 342452 Ribosomal protein L23 342460 Sulfotransferase protein 342509 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 342527 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 342531 Lipoxygenase 342531 PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this 342538 NAC domain 342541 Cyclophilin type peptidyl-prolyl cis-trans isomerase 342543 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 342550 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 342565 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342573 Intermediate filament protein 342574 Intermediate filament protein 342574 Intermediate filament protein 342575 Intermediate filament protein 342575 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 342578 HMG (high mobility group) box 342594 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 342599 Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain 342614 Uncharacterised protein family (UPF0185). This family contains a number of small uncharacterised proteins including BM-002 342615 Adenylosuccinate synthetase 342626 Protein kinase domain 342636 Trypsin 342658 Flavodoxin 342660 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 342661 14-3-3 protein 342662 C2 domain 342667 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 342667 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 342684 Uncharacterized ACR, COG1354 342684 Uncharacterized ACR, COG1579 342684 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 342684 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc 342695 Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family includ 342697 metallopeptidase family M24 342705 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold 342705 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 342705 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 342732 Intermediate filament protein 342741 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 342779 Ribosomal protein L6 342780 Acyltransferase family. This family includes a range of acyltransferase enzymes. This domain is found in many as yet uncharacterised C. elegans proteins and it is approximately 300 amino acids long 342784 BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function 342808 Ribosomal protein S5, C-terminal domain 342808 Ribosomal protein S5, N-terminal domain 342827 PXA domain. This domain is associated with PX domains pfam00787 342839 Ribosomal L10 342842 Sodium:solute symporter family 342851 Sodium:solute symporter family 342854 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 342883 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 342885 Glycosyltransferase family 6 342886 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 342886 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 342886 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 342888 Protein kinase domain 342892 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342894 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342897 F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated 342900 Homeobox domain 342905 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 342908 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342911 Ribosomal protein S28e 342917 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342923 Ribosomal L39 protein 342923 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342926 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342931 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 342933 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 342933 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 342934 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 342934 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 342938 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342939 Homeobox domain 342943 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 342945 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 342958 Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin) 342965 7 transmembrane receptor (rhodopsin family) 342966 7 transmembrane receptor (rhodopsin family) 342967 Actin 342969 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342970 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342972 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342982 Lectin C-type domain. This family includes both long and short form C-type 342984 DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases 342989 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 342989 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342990 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342990 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 342991 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 342994 Ribosomal protein L34e 343002 Ribosomal protein L34e 343011 Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse 343015 Ribosomal protein S19 343018 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 343019 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 343024 Sushi domain (SCR repeat) 343033 S25 ribosomal protein 343042 Ribosomal protein S28e 343045 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 343047 Intermediate filament protein 343059 Ets-domain 343066 Carboxylesterase 343069 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 343082 Amidase 343088 Sushi domain (SCR repeat) 343094 B-box zinc finger 343095 Fibronectin type III domain 343095 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 343113 Uricase 343114 Uricase 343119 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 343122 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 343128 ADP-ribosylation factor family 343128 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 343133 7 transmembrane receptor (rhodopsin family) 343141 Mitochondrial carrier protein 343142 Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding 343145 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 343146 Ribosomal protein S19 343151 Aminotransferase class I and II 343161 HMG (high mobility group) box 343162 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 343164 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 343165 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 343169 7 transmembrane receptor (rhodopsin family) 343170 7 transmembrane receptor (rhodopsin family) 343171 7 transmembrane receptor (rhodopsin family) 343172 7 transmembrane receptor (rhodopsin family) 343173 7 transmembrane receptor (rhodopsin family) 343176 Core histone H2A/H2B/H3/H4 343181 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 343184 Ribosomal protein S5, C-terminal domain 343184 Ribosomal protein S5, N-terminal domain 343186 Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers 343187 Calreticulin family 343190 CUB domain 343205 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 343230 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 343230 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 343233 Zinc-binding dehydrogenase 343259 Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds 343259 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 343263 Fibronectin type III domain 343290 ICE-like protease (caspase) p20 domain 343295 Ribosomal protein L14p/L23e 343295 SKIP/SNW domain. This domain is found in chromatin proteins 343296 Zinc-binding dehydrogenase 343297 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 343300 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 343301 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 343311 Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes 343320 Yippee putative zinc-binding protein 343324 Translation initiation factor SUI1 343326 Intermediate filament protein 343332 Polyprenyl synthetase 343337 Ribosomal protein L21e 343338 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 343338 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 343342 CUB domain 343342 Sushi domain (SCR repeat) 343353 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 343354 Actin 343363 NAC domain 343381 CobN/Magnesium Chelatase. This family contains a domain common to the cobN protein and to magnesium protoporphyrin chelatase. CobN is implicated in the conversion of precorrin-2 to cobyrinic acid. Magnesium protoporphyrin chelatase is involved in chloroph 343384 Cyclophilin type peptidyl-prolyl cis-trans isomerase 343388 Ribosomal protein L31e 343406 7 transmembrane receptor (rhodopsin family) 343407 Hsp90 protein 343408 HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758 343409 7 transmembrane receptor (rhodopsin family) 343429 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 343429 Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA 343433 Amidase 343448 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 343448 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 343448 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 343450 Calcium-activated BK potassium channel alpha subunit 343450 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 343472 Homeobox domain 343476 Ribosomal L39 protein 343477 Hsp90 protein 343479 Ribosomal protein L23 343479 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 343479 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 343489 Kinesin motor domain 343495 Ribosomal protein L6e 343498 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 343498 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 343499 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 343499 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 343500 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 343502 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 343503 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 343503 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 343508 Aconitase family (aconitate hydratase) 343508 Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism 343510 Intermediate filament protein 343515 ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit 343521 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 343523 Amidase 343529 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes 343552 Fork head domain 343563 7 transmembrane receptor (rhodopsin family) 343578 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 343585 Intermediate filament protein 343587 SCP-like extracellular protein. This domain is also found in prokaryotes 343588 Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc 343591 Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure 343593 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 343602 Ribosomal protein S2 343602 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 343638 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 343641 Transglutaminase family 343641 Transglutaminase family, C-terminal ig like domain 343641 Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine, 343651 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 343654 Cadherin domain 343667 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 343671 Cyclophilin type peptidyl-prolyl cis-trans isomerase 343683 Ribosomal protein L24e 343701 Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt 343705 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 343708 HMG (high mobility group) box 343718 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 343720 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 343729 Eukaryotic porin 343729 Ribosomal protein S26e 343744 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 343744 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 343745 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 343745 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 343749 Glutathione peroxidase 343750 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 343762 7 transmembrane receptor (rhodopsin family) 343807 Cytochrome C and Quinol oxidase polypeptide I 343822 Actin 343822 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 343824 Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation 343830 Eukaryotic-type carbonic anhydrase 343831 Protein kinase domain 343839 Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus 343851 Carboxylesterase 343854 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 343854 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 343859 Ribosomal protein S28e 343868 Calponin family repeat 343868 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 343870 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 343875 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 343876 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 343924 7 transmembrane receptor (rhodopsin family) 343930 Helix-loop-helix DNA-binding domain 343943 WD domain, G-beta repeat 343946 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 343948 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 343954 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 343957 GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue 343958 7 transmembrane receptor (rhodopsin family) 343974 Doublecortin 343981 Glutamine synthetase, beta-Grasp domain 343981 Glutamine synthetase, catalytic domain 343995 RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer 344016 BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer 344017 Mitochondrial carrier protein 344018 Helix-loop-helix DNA-binding domain 344022 Homeobox domain 344026 Ribosomal protein S28e 344056 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 344056 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 344065 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 344085 Rad52/22 family double-strand break repair protein 344108 Domain of unknown function DUF130. This family has no known function, it consists of C. elegans proteins and is present as a repeat in some members. The aligned region has 4 conserved cysteine residues and is a maximum of 175 residues long 344138 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 344152 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 344160 NADH dehydrogenase 344160 Cytochrome C and Quinol oxidase polypeptide I 344163 Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E 344167 Fork head domain 344171 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 344172 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 344177 Ribosomal protein L31e 344178 Cyclophilin type peptidyl-prolyl cis-trans isomerase 344179 Hsp90 protein 344181 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 344190 Thymosin beta-4 family 344191 Homeobox domain 344195 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 344222 Cyclophilin type peptidyl-prolyl cis-trans isomerase 344227 Actin 344228 Uncharacterised protein family (UPF0120) 344228 Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport 344231 Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included 344254 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 344262 7 transmembrane receptor (rhodopsin family) 344263 Ribosomal S3Ae family 344284 Homeobox domain 344286 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 344287 Putative zinc finger in N-recognin 344292 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 344299 Cyclophilin type peptidyl-prolyl cis-trans isomerase 344318 Intermediate filament protein 344320 Intermediate filament protein 344320 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 344328 TPR Domain 344357 Cyclophilin type peptidyl-prolyl cis-trans isomerase 344371 Mitochondrial carrier protein 344382 WD domain, G-beta repeat 344387 Protein kinase domain 344394 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 344401 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 344403 HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes. 344405 YbaK / prolyl-tRNA synthetases associated domain. This domain of unknown function is found in numerous prokaryote organisms. The structure of YbaK shows a novel fold. This domain also occurs in a number of prolyl-tRNA synthetases (proRS) from prokaryotes. 344423 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 344424 Ribosomal protein S24e 344424 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 344427 Ribosomal protein L24e 344431 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 344454 CO dehydrogenase flavoprotein C-terminal domain 344454 FAD binding domain in molybdopterin dehydrogenase 344454 Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain 344454 Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain 344458 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 344462 Intermediate filament protein 344462 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 344471 Ribosomal protein L11, RNA binding domain 344471 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 344473 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 344474 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 344494 E1-E2 ATPase 344498 IQ calmodulin-binding motif. Calmodulin-binding motif 344510 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 344516 Ubiquitin carboxyl-terminal hydrolase family 2 344543 Ribosomal L22e protein family 344557 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 344558 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 344559 Ribosomal L22e protein family 344561 7 transmembrane receptor (rhodopsin family) 344565 SH2 domain 344568 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 344570 Transferrin 344572 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 344572 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 344574 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 344593 SH2 domain 344593 Protein-tyrosine phosphatase 344596 Citrate synthase 344620 WD domain, G-beta repeat 344622 Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low 344653 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 344658 SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as 344666 Ribosomal Proteins L2, C-terminal domain 344673 Cytochrome c oxidase subunit III 344673 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 344682 Elongation factor 1 gamma, conserved domain 344682 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 344682 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 344690 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 344694 HMG (high mobility group) box 344702 Homeobox domain 344707 PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) 344715 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 344716 7 transmembrane receptor (rhodopsin family) 344717 7 transmembrane receptor (rhodopsin family) 344718 7 transmembrane receptor (rhodopsin family) 344719 Ribosomal protein L23 344726 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 344728 7 transmembrane receptor (rhodopsin family) 344737 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 344741 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 344744 FKBP-type peptidyl-prolyl cis-trans isomerase 344752 Carboxylesterase 344758 7 transmembrane receptor (rhodopsin family) 344758 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 344758 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 344760 7 transmembrane receptor (metabotropic glutamate family) 344760 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 344762 Intermediate filament protein 344765 Ribosomal protein L35Ae 344767 7 transmembrane receptor (rhodopsin family) 344775 Aminotransferase class-V 344787 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 344788 Ribosomal protein S2 344797 Cyclophilin type peptidyl-prolyl cis-trans isomerase 344805 Trypsin 344805 CUB domain 344811 'Cold-shock' DNA-binding domain 344813 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 344813 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 344819 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 344819 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 344820 SH2 domain 344821 Cyclophilin type peptidyl-prolyl cis-trans isomerase 344829 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 344838 Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha 344866 Intermediate filament protein 344875 von Willebrand factor type A domain 344880 Eukaryotic initiation factor 4E 344890 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 344905 E1-E2 ATPase 344921 GTP1/OBG family 344936 Cadherin domain 344946 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 344951 7 transmembrane receptor (rhodopsin family) 344953 Cyclin-dependent kinase 5 activator protein 344967 Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-Co 344973 Sulfatase 344978 Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri 344988 ADP-ribosylation factor family 344998 Cyclophilin type peptidyl-prolyl cis-trans isomerase 345001 3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero 345012 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 345019 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 345019 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 345019 Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts 345028 Hsp90 protein 345028 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 345035 PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family 345041 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 345051 Reeler domain 345062 Trypsin 345076 Chaperonin 10 Kd subunit 345079 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 345083 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 345095 Ribosomal protein S12 345097 Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29 345100 Trypsin 345100 C2 domain 345101 Trypsin 345101 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 345121 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 345131 Helix-loop-helix DNA-binding domain 345132 Ribosomal L22e protein family 345136 Mitochondrial carrier protein 345157 Hsp90 protein 345157 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 345174 Ribosomal protein L21e 345192 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 345203 7 transmembrane receptor (rhodopsin family) 345212 7 transmembrane receptor (rhodopsin family) 345217 Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown 345217 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 345217 V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo 345248 Hsp90 protein 345249 Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids 345255 Cytochrome c oxidase subunit Vb 345258 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 345261 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 345262 Ribosomal protein S6e 345273 Zinc-binding dehydrogenase 345274 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 345275 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 345275 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 345288 Serum albumin family 345302 Ribosomal protein S5, C-terminal domain 345302 Ribosomal protein S5, N-terminal domain 345305 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 345310 ADP-ribosylation factor family 345310 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 345315 GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E 345319 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 345348 Skp1 family, dimerisation domain 345348 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play 345355 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes 345358 Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction 345363 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 345369 Family 4 glycosyl hydrolase 345372 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 345375 Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses 345378 UDP-glucoronosyl and UDP-glucosyl transferase 345379 UDP-glucoronosyl and UDP-glucosyl transferase 345381 UDP-glucoronosyl and UDP-glucosyl transferase 345381 Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai 345392 Ribosomal protein L6e 345396 Intermediate filament protein 345412 Chaperonin 10 Kd subunit 345420 Xanthomonas avirulence protein, Avr/PthA 345430 Intermediate filament protein 345443 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 345456 Profilin 345462 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 345463 'Cold-shock' DNA-binding domain 345466 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T 345487 Ribosomal protein L21e 345505 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 345507 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 345513 Intermediate filament protein 345528 Kinesin motor domain 345537 TRAF-type zinc finger 345537 MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans 345540 Arginosuccinate synthase. This family contains a PP-loop motif 345547 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 345557 Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit 345561 Cyclophilin type peptidyl-prolyl cis-trans isomerase 345571 RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways 345571 Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o 345571 FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi 345573 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 345587 Vinculin family 345589 Ribosomal S17 345621 Cyclophilin type peptidyl-prolyl cis-trans isomerase 345630 Fibrillarin 345637 Arginosuccinate synthase. This family contains a PP-loop motif 345637 tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs 345645 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 345651 Actin 345654 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 345659 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 345663 Actin 345678 'Cold-shock' DNA-binding domain 345687 Ribosomal L39 protein 345688 Ribosomal protein L15 345711 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 345716 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 345720 Actin 345751 Intermediate filament protein 345754 ATP synthase A chain 345754 Cytochrome c oxidase subunit III 345754 Cytochrome C and Quinol oxidase polypeptide I 345754 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 345754 Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 345754 Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 345761 PH domain. PH stands for pleckstrin homology 345768 Actin 345774 Ribosomal S3Ae family 345774 HMG (high mobility group) box 345775 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 345779 Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest 345781 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 345783 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 345783 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 345799 Ribosomal L10 345799 Ribosomal S3Ae family 345818 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 345820 Cullin family 345829 NAC domain 345841 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 345850 Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and proteins 345864 'Cold-shock' DNA-binding domain 345874 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 345882 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 345884 Intermediate filament protein 345884 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 345885 metallopeptidase family M24 345893 'chromo' (CHRromatin Organization MOdifier) domain 345893 Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain 345922 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 345922 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 345923 HMG (high mobility group) box 345925 NAC domain 345930 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 345947 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 345982 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 345988 pfam02887, PK_C, Pyruvate kinase, alpha/beta domain 345988 Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains 345993 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 345994 Isocitrate/isopropylmalate dehydrogenase 346004 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 346005 EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains. 346010 Intermediate filament protein 346012 Thymosin beta-4 family 346024 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 346024 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 346033 HMG (high mobility group) box 346036 Intermediate filament protein 346036 Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex 346036 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 346037 Translationally controlled tumor protein 346042 Connexin 346072 Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta 346085 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 346085 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 346096 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 346113 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 346123 Ribosomal protein L21e 346129 Intermediate filament protein 346130 Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines 346149 Core histone H2A/H2B/H3/H4 346149 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 346162 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 346167 7 transmembrane receptor (rhodopsin family) 346168 7 transmembrane receptor (rhodopsin family) 346169 7 transmembrane receptor (rhodopsin family) 346170 7 transmembrane receptor (rhodopsin family) 346171 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 346171 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 346173 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 346177 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 346189 Cytosol aminopeptidase family, N-terminal domain 346189 Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase 346190 ADP-ribosylation factor family 346190 7 transmembrane receptor (rhodopsin family) 346190 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 346191 Ribosomal protein L11, RNA binding domain 346191 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 346207 Ribosomal L29e protein family 346217 Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals 346225 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 346230 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 346232 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 346233 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 346234 Ribosomal protein L31e 346239 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 346248 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 346248 Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold 346249 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 346250 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 346250 Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold 346252 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 346284 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 346286 ADP-ribosylation factor family 346286 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 346288 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 346294 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 346295 14-3-3 protein 346317 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 346319 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 346321 Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do 346322 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 346322 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 346323 Ribosomal protein L24e 346325 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 346327 Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown 346327 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 346329 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 346355 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 346359 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 346362 7 transmembrane receptor (rhodopsin family) 346363 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 346379 Uncharacterized protein family UPF0007 346402 Ribosomal protein L11, RNA binding domain 346404 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 346412 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 346416 HMG (high mobility group) box 346423 Cyclophilin type peptidyl-prolyl cis-trans isomerase 346446 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 346449 7 transmembrane receptor (rhodopsin family) 346453 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 346454 Protein kinase domain 346472 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 346476 ADP-ribosylation factor family 346476 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 346517 7 transmembrane receptor (rhodopsin family) 346521 HIT family 346522 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 346522 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 346524 7 transmembrane receptor (rhodopsin family) 346525 7 transmembrane receptor (rhodopsin family) 346528 7 transmembrane receptor (rhodopsin family) 346542 Protein kinase domain 346546 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 346559 Ribosomal L10 346562 ADP-ribosylation factor family 346562 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 346564 Eukaryotic porin 346565 Cadherin domain 346565 Eukaryotic initiation factor 4E 346566 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 346566 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 346600 Clathrin adaptor complex small chain 346606 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 346606 Ribosomal protein S2 346606 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 346643 Ribosomal protein S5, C-terminal domain 346643 Ribosomal protein S5, N-terminal domain 346644 Cyclophilin type peptidyl-prolyl cis-trans isomerase 346668 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 346671 IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain 346674 Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict 346682 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 346702 Trypsin 346706 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 346707 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 346707 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 346708 7 transmembrane receptor (rhodopsin family) 346711 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 346722 C2 domain 346723 ADP-ribosylation factor family 346730 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 346741 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron 346762 Protein kinase domain 346762 Ribosomal protein L21e 346802 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 346848 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 346853 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 346856 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 346867 G10 protein 346884 Ribosomal protein S6e 346884 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 346887 Sugar (and other) transporter 346897 Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue 346904 CUB domain 346905 CUB domain 346905 Sushi domain (SCR repeat) 346931 Adenylate kinase 346931 Ribosomal protein S26e 346950 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 346971 tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included 346986 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 346988 BAG domain. Domain present in Hsp70 regulators 346989 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 346990 Core histone H2A/H2B/H3/H4 346991 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 346998 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 347013 Ribosomal protein L14p/L23e 347028 Peptidase family M41 347028 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 347031 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 347038 TMS membrane protein/tumour differentially expressed protein (TDE) 347046 HMG (high mobility group) box 347048 Inositol monophosphatase family 347048 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 347051 Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa 347055 Tropomyosin 347074 Ribosomal protein L34e 347085 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347088 7 transmembrane receptor (Secretin family) 347088 7 transmembrane receptor (Secretin family) 347088 Pentaxin family. Pentaxins are also known as pentraxins 347088 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 347099 Adenylate kinase 347115 Fork head domain 347122 Cytochrome C and Quinol oxidase polypeptide I 347165 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 347166 Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN 347168 7 transmembrane receptor (rhodopsin family) 347169 7 transmembrane receptor (rhodopsin family) 347177 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347179 Ribosomal S3Ae family 347206 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 347207 Protein kinase domain 347218 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 347233 Mitochondrial carrier protein 347235 Arginosuccinate synthase. This family contains a PP-loop motif 347240 Kinesin motor domain 347252 Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande 347263 Microtubule associated protein (MAP65/ASE1 family) 347263 IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation 347263 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 347264 Intermediate filament protein 347265 Intermediate filament protein 347265 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 347287 Cyclophilin type peptidyl-prolyl cis-trans isomerase 347288 Ribosomal protein L44 347292 Ribosomal protein L36e 347295 'Cold-shock' DNA-binding domain 347305 Protein kinase domain 347314 7 transmembrane receptor (rhodopsin family) 347316 7 transmembrane receptor (rhodopsin family) 347318 Ribosomal protein L21e 347319 Protein-tyrosine phosphatase 347333 Intermediate filament protein 347336 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347337 Actin 347344 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 347345 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 347346 Aminotransferase class-III 347347 Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily 347348 Hsp20/alpha crystallin family 347358 YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named 347362 Raf-like Ras-binding domain 347362 Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain 347363 Kinesin motor domain 347365 von Willebrand factor type A domain 347376 Core histone H2A/H2B/H3/H4 347381 Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 347381 Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase 347381 Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains. 347386 NADH dehydrogenase 347393 Ets-domain 347393 Core histone H2A/H2B/H3/H4 347395 Core histone H2A/H2B/H3/H4 347400 DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh 347400 DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds 347403 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 347411 Uncharacterised protein family (UPF0041) 347418 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347419 GrpE 347420 K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r 347421 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 347422 N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e 347428 TPR Domain 347441 Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins 347442 WD domain, G-beta repeat 347442 Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity 347449 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 347452 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 347454 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 347458 metallopeptidase family M24 347468 7 transmembrane receptor (rhodopsin family) 347480 PWI domain 347492 Rhabdovirus Non-virion protein. Infectious hematopoietic necrosis virus (IHNV) is a member of the family Rhabdoviridae. The non-virion protein (NV) is coded for by one of the six genes of the IHNV genome, but is absent in vesiculovirus -like rhabdovirus 347504 Proteasome A-type and B-type 347512 HSF-type DNA-binding 347516 Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT) 347517 ADP-ribosylation factor family 347517 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 347527 Sulfatase 347530 pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. 347530 Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi 347541 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 347542 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 347544 Ribosomal L18ae protein family 347556 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 347572 WAP-type (Whey Acidic Protein) 'four-disulfide core' 347573 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 347582 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 347583 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347598 Trypsin 347598 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347604 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347617 ADP-ribosylation factor family 347617 7 transmembrane receptor (rhodopsin family) 347624 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 347628 Ribosomal protein S2 347632 7 transmembrane receptor (rhodopsin family) 347634 7 transmembrane receptor (rhodopsin family) 347635 7 transmembrane receptor (rhodopsin family) 347636 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 347636 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 347639 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 347649 Class I Histocompatibility antigen, domains alpha 1 and 2 347650 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 347653 Ribosomal protein S2 347655 7 transmembrane receptor (rhodopsin family) 347657 7 transmembrane receptor (rhodopsin family) 347659 7 transmembrane receptor (rhodopsin family) 347660 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 347660 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 347661 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 347663 Zinc finger, C3HC4 type (RING finger) 347665 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 347670 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 347722 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 347722 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 347722 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 347722 Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 347722 Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs 347722 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 347722 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 347731 Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 347732 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 347735 TMS membrane protein/tumour differentially expressed protein (TDE) 347736 Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise 347741 Protein of unknown function, DUF270 347741 Protein of unknown function, DUF270 347754 Homeobox domain 347755 Homeobox domain 347756 Homeobox domain 347757 Homeobox domain 347761 Ribosomal protein S7e 347765 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 347766 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 347773 CUB domain 347777 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 347779 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 347784 Homeobox domain 347785 Homeobox domain 347787 Homeobox domain 347788 Homeobox domain 347789 Homeobox domain 347790 Homeobox domain 347791 Homeobox domain 347792 Homeobox domain 347796 Intermediate filament protein 347797 Actin 347798 ATP synthase subunit C 347799 Cytochrome C and Quinol oxidase polypeptide I 347799 Cytochrome C oxidase subunit II, transmembrane domain. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices 347800 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 347801 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 347808 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 347809 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347812 Uncharacterised protein family (UPF0100). This family of proteins are periplasmic binding proteins that includes ModA the molybdate-binding protein. Some members of this family are in the UPF0100 family 347812 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 347813 Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP 347822 7 transmembrane receptor (rhodopsin family) 347827 7 transmembrane receptor (rhodopsin family) 347828 Eukaryotic porin 347835 7 transmembrane receptor (rhodopsin family) 347837 Intermediate filament protein 347840 Ribosomal protein S27 347844 7 transmembrane receptor (rhodopsin family) 347845 7 transmembrane receptor (rhodopsin family) 347849 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 347853 T-box. The T-box encodes a 180 amino acid domain that binds to DNA 347856 Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch 347860 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 347862 DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117 347867 Zinc finger, C3HC4 type (RING finger) 347868 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 347869 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 347870 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 347871 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 347873 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347875 Ribosomal protein L44 347876 Serum amyloid A protein 347878 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 347885 7 transmembrane receptor (rhodopsin family) 347886 7 transmembrane receptor (rhodopsin family) 347890 7 transmembrane receptor (rhodopsin family) 347891 7 transmembrane receptor (rhodopsin family) 347892 7 transmembrane receptor (rhodopsin family) 347894 Ribosomal Proteins L2, C-terminal domain 347897 Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle 347898 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 347904 SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members 347904 SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c ox 347907 Intermediate filament protein 347908 Intermediate filament protein 347911 7 transmembrane receptor (rhodopsin family) 347912 7 transmembrane receptor (rhodopsin family) 347914 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 347915 14-3-3 protein 347921 Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps 347922 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 347925 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 347930 Helix-loop-helix DNA-binding domain 347931 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 347936 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 347936 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 347937 Intermediate filament protein 347937 BTG1 family. A novel family of anti-proliferative proteins 347937 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region 347938 Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family 347941 Ribosomal protein L19e 347942 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 347944 Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1 347947 Domain of unknown function (DUF341) 347947 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 347947 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 347948 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 347960 Ribosomal protein L35Ae 347961 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 347962 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 347966 Galactoside-binding lectin 347967 Core histone H2A/H2B/H3/H4 347969 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 347970 DNA directed RNA polymerase, 7 kDa subunit 347972 Mitochondrial carrier protein 347974 ADP-ribosylation factor family 347975 Homeobox domain 347986 7 transmembrane receptor (rhodopsin family) 347987 7 transmembrane receptor (rhodopsin family) 347988 Ribosomal protein L13 347988 7 transmembrane receptor (rhodopsin family) 347992 Ribosomal L15 347994 ADP-ribosylation factor family 348000 Class II histocompatibility antigen, alpha domain 348001 7 transmembrane receptor (rhodopsin family) 348002 7 transmembrane receptor (rhodopsin family) 348003 Ribosomal protein L13 348003 7 transmembrane receptor (rhodopsin family) 348005 Zinc finger, C3HC4 type (RING finger) 348008 Ribosomal L15 348016 Gamma-glutamyltranspeptidase 348019 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 348026 Ribosomal protein S21e 348028 7 transmembrane receptor (rhodopsin family) 348029 7 transmembrane receptor (rhodopsin family) 348030 7 transmembrane receptor (rhodopsin family) 348031 7 transmembrane receptor (rhodopsin family) 348032 7 transmembrane receptor (rhodopsin family) 348033 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 348034 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 348036 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 348037 Intermediate filament protein 348038 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348039 Eukaryotic ribosomal protein L18 348040 PH domain. PH stands for pleckstrin homology 348042 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 348045 Ribosomal protein S8 348048 Ribosomal protein L24e 348050 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 348052 Cytochrome b(N-terminal)/b6/petB 348053 Ubiquitin carboxyl-terminal hydrolase family 2 348054 Ribosomal protein L21e 348055 Ribosomal protein L21e 348071 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 348074 Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi 348076 Actin 348077 Dynamin GTPase effector domain 348078 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 348078 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 348079 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 348080 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 348081 Ribosomal protein S19 348085 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 348087 S-adenosyl-L-homocysteine hydrolase 348092 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 348093 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348094 Death domain 348094 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 348095 BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p 348095 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. 348100 Dynamin GTPase effector domain 348101 Intermediate filament protein 348104 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 348105 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 348111 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348114 Disintegrin 348114 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not 348146 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 348148 Cyclic nucleotide-binding domain 348148 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 348150 ATP synthase A chain 348150 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 348158 AMP-binding enzyme 348159 AMP-binding enzyme 348163 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 348163 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 348163 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 348174 Lectin C-type domain. This family includes both long and short form C-type 348174 SCP-like extracellular protein. This domain is also found in prokaryotes 348178 Actin 348180 Uncharacterized protein family UPF0021 348186 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 348189 Kinesin motor domain 348191 Lectin C-type domain. This family includes both long and short form C-type 348200 Poly-adenylate binding protein, unique domain 348201 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348205 C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system 348209 Sodium:neurotransmitter symporter family 348214 Helix-loop-helix DNA-binding domain 348215 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 348217 Zinc finger, C3HC4 type (RING finger) 348224 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 348229 Intermediate filament protein 348243 Galactoside-binding lectin 348245 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 348248 Ribosomal L39 protein 348250 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 348251 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 348251 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 348257 TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases 348258 bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170 348263 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 348267 Ribosomal protein L6e 348267 Ribosomal protein L6, N-terminal domain 348269 Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions 348277 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 348278 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 348280 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 348282 Ribosomal protein S8 348283 MCM2/3/5 family 348285 Ribosomal protein L11, RNA binding domain 348286 Bacterial extracellular solute-binding protein, family 7. This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein 348289 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 348292 Ribosomal protein L10 348296 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 348297 Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity 348321 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348324 Ribosomal protein L13e 348327 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348327 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348327 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348330 Ribosomal protein S5, N-terminal domain 348331 Trypsin 348337 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 348339 7 transmembrane receptor (rhodopsin family) 348340 Ribosomal family S4e 348346 7 transmembrane receptor (rhodopsin family) 348347 Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase 348349 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 348350 YjeF-related protein N-terminus 348354 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348360 P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function 348362 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348364 Ribosomal protein L21e 348372 Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of 348375 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 348376 Ribosomal protein L10 348380 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 348385 Intermediate filament protein 348394 Ribosomal protein L13e 348395 Cytochrome c oxidase subunit Vb 348402 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 348402 Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa 348415 Eukaryotic porin 348419 Hsp90 protein 348419 Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90 348420 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 348421 Ribosomal protein S12 348423 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 348424 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 348429 Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP 348434 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 348435 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 348435 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 348436 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 348440 Cytochrome C and Quinol oxidase polypeptide I 348441 Ribosomal protein S17 348447 Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical 348449 7 transmembrane receptor (rhodopsin family) 348451 Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This 348453 7 transmembrane receptor (metabotropic glutamate family) 348453 Cell cycle protein. This entry includes the following members 348453 Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure 348459 Dynein light chain type 1 348463 Actin 348464 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 348467 Cytochrome c oxidase subunit III 348471 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348472 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348477 Intermediate filament protein 348477 Intermediate filament protein 348478 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 348478 Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone 348479 Zinc finger, C3HC4 type (RING finger) 348512 Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic 348516 7 transmembrane receptor (rhodopsin family) 348517 7 transmembrane receptor (rhodopsin family) 348518 7 transmembrane receptor (rhodopsin family) 348519 7 transmembrane receptor (rhodopsin family) 348526 Octicosapeptide repeat. Short motif that may bind Ca2+ 348539 Intermediate filament protein 348546 Ribosomal protein S16 348548 Ribosomal protein L19e 348550 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 348556 Ribosomal L15 348563 Ribosomal protein S5, C-terminal domain 348563 Ribosomal protein S5, N-terminal domain 348566 7 transmembrane receptor (rhodopsin family) 348570 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348573 Translationally controlled tumor protein 348589 SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains 348590 Calcium-activated potassium channel, beta subunit 348598 Gamma-glutamyltranspeptidase 348600 Phosphatidylinositol 3- and 4-kinase 348602 Gamma-glutamyltranspeptidase 348607 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 348618 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 348619 ADP-ribosylation factor family 348619 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 348621 Ribosomal S3Ae family 348628 Cytochrome c oxidase subunit III 348633 Actin 348653 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 348662 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 348664 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 348668 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 348670 S-adenosyl-L-homocysteine hydrolase 348675 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348677 Sulfotransferase protein 348678 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 348679 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 348679 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 348688 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348691 Ribosomal protein L21e 348693 Citrate synthase 348695 Ribosomal protein S2 348698 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 348705 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 348720 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 348722 Poly-adenylate binding protein, unique domain 348726 metallopeptidase family M24 348729 Ribosomal protein L21e 348731 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 348732 NADH dehydrogenase 348733 NADH-ubiquinone/plastoquinone oxidoreductase chain 4L 348733 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 348737 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 348742 TT viral orf 1. TT virus (TTV), isolated initially from a Japanese patient with hepatitis of unknown aetiology, has since been found to infect both healthy and diseased individuals and numerous prevalence studies have raised questions about its role in un 348745 Ribosomal protein L21e 348746 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348750 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 348752 Cytochrome b(C-terminal)/b6/petD 348752 Cytochrome C and Quinol oxidase polypeptide I 348753 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348754 Ribosomal protein L6 348755 Ribosomal protein S27 348756 Ribosomal protein L31e 348757 HMG (high mobility group) box 348769 PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains 348772 Ribosomal protein L21e 348774 HMG (high mobility group) box 348774 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 348775 G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines 348778 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348787 Cytochrome C and Quinol oxidase polypeptide I 348791 FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase 348793 WD domain, G-beta repeat 348794 Ribosomal protein L36e 348796 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 348800 7 transmembrane receptor (rhodopsin family) 348806 Ribosomal L10 348810 Translationally controlled tumor protein 348811 HMG (high mobility group) box 348813 Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid 348814 Elongation factor 1 gamma, conserved domain 348814 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 348819 Insulinase (Peptidase family M16) 348822 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 348823 Ribosomal L39 protein 348826 Cytochrome c oxidase subunit III 348826 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 348828 Actin 348829 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 348832 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 348838 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348841 Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte 348841 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 348844 Gastrin/cholecystokinin family 348845 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 348847 Ribosomal protein L6e 348848 Tetraspanin family 348852 Ribosomal protein L6e 348852 Ribosomal protein L6, N-terminal domain 348856 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 348860 Transcription initiation factor TFIID 23-30kDa subunit 348863 Intermediate filament protein 348867 Ribosomal S3Ae family 348868 Ribosomal protein S11 348869 AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4- 348870 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 348871 Cytochrome C and Quinol oxidase polypeptide I 348871 Cytochrome C oxidase subunit II, periplasmic domain 348872 HMG14 and HMG17 348873 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 348879 Ribosomal protein L19e 348884 Ribosomal protein L6e 348885 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 348889 Ribosomal L22e protein family 348895 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348901 Ubiquitin carboxyl-terminal hydrolase family 2 348901 Ubiquitin carboxyl-terminal hydrolases family 2 348902 Ubiquitin carboxyl-terminal hydrolase family 2 348902 Ubiquitin carboxyl-terminal hydrolases family 2 348903 Ubiquitin carboxyl-terminal hydrolase family 2 348903 Ubiquitin carboxyl-terminal hydrolases family 2 348904 Ubiquitin carboxyl-terminal hydrolase family 2 348904 Ubiquitin carboxyl-terminal hydrolases family 2 348905 Ubiquitin carboxyl-terminal hydrolase family 2 348905 Ubiquitin carboxyl-terminal hydrolases family 2 348906 Ubiquitin carboxyl-terminal hydrolase family 2 348906 Ubiquitin carboxyl-terminal hydrolases family 2 348907 Ubiquitin carboxyl-terminal hydrolases family 2 348908 Ubiquitin carboxyl-terminal hydrolase family 2 348908 Ubiquitin carboxyl-terminal hydrolases family 2 348909 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348911 Herpes virus major capsid protein. This family represents the major capsid protein (MCP) of herpes viruses. The capsid shell consists of 150 MCP hexamers and 12 MCP pentamers. One pentamer is found at each of the 12 apices of the icosahedral shell, and th 348916 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 348917 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play 348920 UDP-glucoronosyl and UDP-glucosyl transferase 348921 UDP-glucoronosyl and UDP-glucosyl transferase 348922 UDP-glucoronosyl and UDP-glucosyl transferase 348923 UDP-glucoronosyl and UDP-glucosyl transferase 348932 Sodium:neurotransmitter symporter family 348934 Ribosomal protein S12 348948 Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa 348949 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 348949 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 348951 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 348952 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 348953 HMG (high mobility group) box 348955 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 348956 Ribosomal protein L21e 348961 Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the C 348963 Ribosomal protein S19e 348965 Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases 348966 Adenylate kinase 348967 Ribosomal protein L31e 348975 Core histone H2A/H2B/H3/H4 348978 CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP 348980 Cyclic nucleotide-binding domain 348980 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 348980 Cyclic nucleotide-binding domain 348980 Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is 348982 Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase 348983 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 348984 Ribosomal L10 348987 Cyclophilin type peptidyl-prolyl cis-trans isomerase 348996 Cytochrome C oxidase subunit II, periplasmic domain 348996 Cytochrome C oxidase subunit II, transmembrane domain. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices 349000 AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA) 349001 Ribosomal protein S2 349003 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 349004 Ribosomal protein L21e 349004 EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains 349005 Ribosomal L15 349008 Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 349011 S25 ribosomal protein 349013 ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes 349014 Ribosomal protein S26e 349017 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 349018 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 349019 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 349020 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 349021 Ribosomal protein L21e 349025 Ribosomal protein S11 349029 Translationally controlled tumor protein 349029 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 349033 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 349033 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 349035 Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities 349036 Ribosomal L10 349046 Intermediate filament protein 349049 Ubiquitin carboxyl-terminal hydrolase family 2 349052 Ribosomal protein S2 349060 Bacterial regulatory helix-turn-helix protein, lysR family 349060 LysR substrate binding domain. The structure of this domain is known and is similar to the periplasmic binding proteins 349061 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 349065 Protein kinase domain 349066 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 349069 von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094 349073 EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi 349075 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 349083 Ribosomal protein L21e 349084 Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal 349088 Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after) 349089 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 349105 Ribosomal protein L6e 349109 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 349121 NADH dehydrogenase 349136 WD domain, G-beta repeat 349139 Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon 349141 UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to 349142 GrpE 349143 ADP-ribosylation factor family 349143 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 349146 Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats 349149 Connexin 349149 Connexin 349154 Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but func 349158 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 349173 Ribosomal protein L24e 349177 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 349179 Minor capsid protein VI. This minor capsid protein may act as a link between the external capsid and the internal DNA-protein core. The C-terminal 11 residues may function as a protease cofactor leading to enzyme activation 349182 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 349188 7 transmembrane receptor (rhodopsin family) 349189 Ubiquitin carboxyl-terminal hydrolase family 2 349197 Aminotransferase class I and II 349198 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 349201 Ribosomal protein L44 349203 Ribosomal protein S26e 349206 Pou domain - N-terminal to homeobox domain 349209 Ribosomal protein S2 349210 Eukaryotic ribosomal protein L18 349222 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 349223 Cyclophilin type peptidyl-prolyl cis-trans isomerase 349224 Ribosomal protein L13e 349227 Intermediate filament protein 349230 Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel 349231 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 349233 Cyclic nucleotide-binding domain 349237 Synaptogyrin. This family of proteins is distantly related to pfam01284 349246 Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins 349253 ATP synthase alpha/beta chain, C terminal domain 349253 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 349253 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 349255 7 transmembrane receptor (metabotropic glutamate family) 349258 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 349259 ATP synthase alpha/beta chain, C terminal domain 349259 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 349261 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 349264 ATP synthase alpha/beta chain, C terminal domain 349264 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 349264 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 349266 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 349267 ATP synthase alpha/beta chain, C terminal domain 349267 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 349280 Calponin family repeat 349280 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 349284 Calponin family repeat 349284 Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no 349285 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 349288 7 transmembrane receptor (Secretin family) 349288 Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin 349289 Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling. 349293 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 349294 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 349304 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 349311 pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II 349314 Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol 349318 Fork head domain 349320 ATP synthase alpha/beta chain, C terminal domain 349320 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 349325 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 349326 Protein kinase domain 349326 pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess 349331 Protein of unknown function DUF82. This prokaryotic protein family has no known function. The protein contains four conserved cysteines that may be involved in metal binding or disulphide bridges 349334 Fork head domain 349335 Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans 349345 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 349356 Actin 349363 ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella 349368 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 349369 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 349370 Homeobox domain 349371 PMP-22/EMP/MP20/Claudin family 349372 Homeobox domain 349373 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 349373 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 349374 C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes 349375 Pancreatic hormone peptide 349376 NADH dehydrogenase 349380 Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel 349380 Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot 349384 Ribosomal L18ae protein family 349385 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 349386 Homeobox domain 349395 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 349397 Lysosome-associated membrane glycoprotein (Lamp) 349398 LIM domain. This family represents two copies of the LIM structural domain 349400 Ets-domain 349400 Sterile alpha motif (SAM)/Pointed domain 349404 Protein kinase domain 349405 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 349406 Ribosomal protein S2 349406 Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is 349409 ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown 349411 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 349412 Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes 349413 Cation-dependent mannose-6-phosphate receptor 349414 Cyclophilin type peptidyl-prolyl cis-trans isomerase 349415 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 349418 WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro 349419 7 transmembrane receptor (rhodopsin family) 349424 AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt 349426 Glu/Leu/Phe/Val dehydrogenase, dimerisation domain 349426 Glutamate/Leucine/Phenylalanine/Valine dehydrogenase 349434 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b 349436 LIM domain. This family represents two copies of the LIM structural domain 349439 SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin 349441 Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues 349443 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 349444 F-actin capping protein alpha subunit 349446 Sulfatase 349448 Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial 349450 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 349479 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 349509 Homeobox domain 349519 Glu/Leu/Phe/Val dehydrogenase, dimerisation domain 349519 Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region 349550 7 transmembrane receptor (rhodopsin family) 349556 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 349557 CUB domain 349589 Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure 349599 Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7 349603 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 349611 Cyclophilin type peptidyl-prolyl cis-trans isomerase 349624 ICE-like protease (caspase) p10 domain 349624 ICE-like protease (caspase) p20 domain 349625 ICE-like protease (caspase) p10 domain 349625 ICE-like protease (caspase) p20 domain 349632 Ribosomal S17 349636 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 349642 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 349642 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 349646 Glutamine synthetase, catalytic domain 349646 Glutamine synthetase, beta-Grasp domain 349648 Intermediate filament protein 349664 7 transmembrane receptor (rhodopsin family) 349666 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 349668 Ribosomal family S4e 349669 Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans 349673 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 349675 Ribosomal S3Ae family 349677 7 transmembrane receptor (rhodopsin family) 349698 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 349712 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 349730 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 349734 Ribosomal protein L44 349741 Ribosomal protein L34e 349758 RanBP1 domain 349772 Ribosomal protein L21e 349774 Anaphase-promoting complex, subunit 10 (APC10) 349781 Ribosomal protein L31e 349792 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 349796 Cyclophilin type peptidyl-prolyl cis-trans isomerase 349803 Ribosomal protein S17 349814 Homeobox domain 349815 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold 349815 6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each 349816 Ribosomal L10 349818 Nucleolar RNA-binding protein, Nop10p family. Nop10p is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA s 349820 Ribosomal protein S6e 349824 Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer 349831 Ribosomal protein S6e 349842 HMG14 and HMG17 349842 G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase 349848 Ribosomal L39 protein 349861 Homeobox domain 349869 Chaperonin 10 Kd subunit 349875 Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma 349881 Ribosomal protein L11, RNA binding domain 349881 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 349884 GrpE 349886 Intermediate filament protein 349892 Hsp90 protein 349898 Ribosomal protein S6e 349899 7 transmembrane receptor (rhodopsin family) 349907 ATP synthase subunit C 349933 Eukaryotic porin 349948 pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium 349954 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 349954 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 349957 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 349960 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 349963 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 349963 DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA 349974 Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding 349996 Nucleoside diphosphate kinase 349997 NADH-ubiquinone/plastoquinone oxidoreductase, chain 3 350003 Protein-tyrosine phosphatase 350005 WD domain, G-beta repeat 350006 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 350009 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 350010 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 350012 Ribosomal L39 protein 350018 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 350042 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 350042 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 350048 Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc 350052 Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib 350053 Ribosomal protein L21e 350070 Ribosomal L22e protein family 350077 Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter 350082 Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich 350104 Ribosomal protein S2 350121 Dynamin GTPase effector domain 350125 Dynamin GTPase effector domain 350127 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 350131 Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins 350144 Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch 350154 Ribosomal protein S15 350156 Ribosomal protein L21e 350156 SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation 350178 HMG14 and HMG17 350203 Poly-adenylate binding protein, unique domain 350223 7 transmembrane receptor (rhodopsin family) 350244 NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase 350264 Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t 350292 Cyclophilin type peptidyl-prolyl cis-trans isomerase 350330 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 350351 ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte 350354 Protein kinase domain 350363 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 350378 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 350383 7 transmembrane receptor (rhodopsin family) 350383 7 transmembrane receptor (rhodopsin family) 350393 Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour 350426 SH2 domain 350431 Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine 350451 C2 domain 350455 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 350457 Inward rectifier potassium channel 350462 Lipoxygenase 350485 Translation initiation factor SUI1 350496 pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc 350515 Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase 350543 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 350568 Ribosomal protein S2 350588 Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1 350601 D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389 350604 Galactoside-binding lectin 350614 7 transmembrane receptor (metabotropic glutamate family) 350616 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 350616 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 350617 7 transmembrane receptor (rhodopsin family) 350623 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 350628 7 transmembrane receptor (rhodopsin family) 350631 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 350632 Ribosomal L39 protein 350634 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 350636 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 350638 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 350639 Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient 350641 Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w 350642 Ribosomal protein S19e 350652 RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers 350671 Ribosomal protein S24e 350678 Ribosomal L39 protein 350686 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 350689 Cytochrome C and Quinol oxidase polypeptide I 350693 7 transmembrane receptor (rhodopsin family) 350694 Core histone H2A/H2B/H3/H4 350694 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 350695 Core histone H2A/H2B/H3/H4 350696 Core histone H2A/H2B/H3/H4 350701 Elongation factor 1 gamma, conserved domain 350704 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 350718 7 transmembrane receptor (rhodopsin family) 350721 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 350723 Cytochrome c oxidase subunit III 350744 Lectin C-type domain. This family includes both long and short form C-type 350746 Ribosomal protein S26e 350747 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 350766 Ribosomal protein S5, N-terminal domain 350767 CUB domain 350778 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes 350782 HesB-like domain. This family includes HesB which may be involved in nitrogen fixation 350792 7 transmembrane receptor (rhodopsin family) 350793 Pentaxin family. Pentaxins are also known as pentraxins 350799 Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase 350802 Amidase 350806 Ribosomal protein S5, C-terminal domain 350806 Ribosomal protein S5, N-terminal domain 350815 NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions 350815 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor 350816 FKBP-type peptidyl-prolyl cis-trans isomerase 350818 Ribosomal L39 protein 350822 Ribosomal protein L6e 350843 7 transmembrane receptor (rhodopsin family) 350864 Calpain family cysteine protease 350864 Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic 350868 Protein kinase domain 350871 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 350878 Ribosomal protein S7e 350893 7 transmembrane receptor (rhodopsin family) 350921 Ribosomal protein L13e 350955 Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi 350965 Homeobox domain 350982 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 350997 Ribosomal Proteins L2, C-terminal domain 351001 SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma 351022 Herpes virus major capsid protein. This family represents the major capsid protein (MCP) of herpes viruses. The capsid shell consists of 150 MCP hexamers and 12 MCP pentamers. One pentamer is found at each of the 12 apices of the icosahedral shell, and th 351040 LIM domain. This family represents two copies of the LIM structural domain 351047 Uncharacterised protein family (UPF0184) 351049 Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors 351076 RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases 351098 NADH dehydrogenase 351145 Ribosomal protein L21e 351149 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 351149 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 351150 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 351152 Ribosomal protein L11, RNA binding domain 351152 Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain 351178 ribosomal L5P family C-terminus. This region is found associated with pfam00281 351195 WD domain, G-beta repeat 351207 Glutamine amidotransferases class-II 351209 Zinc finger, C3HC4 type (RING finger) 351209 7 transmembrane receptor (rhodopsin family) 351222 Ribosomal protein S21e 351223 Ribosomal L22e protein family 351225 7 transmembrane receptor (rhodopsin family) 351227 Actin 351240 Ribosomal L15 351253 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 351259 Cytochrome C and Quinol oxidase polypeptide I 351259 Cytochrome C oxidase subunit II, periplasmic domain 351273 Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of 351281 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 351298 Ribosomal protein L23 351320 Ribosomal protein S2 351323 Cyclophilin type peptidyl-prolyl cis-trans isomerase 351332 IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated 351348 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 351351 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 351383 7 transmembrane receptor (rhodopsin family) 351390 Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins 351391 Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt 351421 Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate 351425 Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein 351458 7 transmembrane receptor (rhodopsin family) 351465 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 351471 Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes 351494 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 351507 ATP synthase subunit C 351508 Actin 351515 Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria 351525 Intermediate filament protein 351532 Ribosomal protein L23 351567 Fructose-bisphosphate aldolase class-I 351575 HIT family 351590 Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind 351593 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 351596 Protein kinase domain 351597 Furin-like cysteine rich region 351597 Giardia variant-specific surface protein 351600 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 351607 Ribosomal protein L31e 351638 Elongation factor 1 gamma, conserved domain 351638 Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 351638 Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f 351642 Ribosomal protein S5, C-terminal domain 351642 Ribosomal protein S5, N-terminal domain 351655 Adenylate kinase 351672 Zinc finger, C3HC4 type (RING finger) 351672 SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues 351677 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 351682 Trypsin 351726 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 351732 Ribosomal S3Ae family 351746 Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea 351753 NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane 351780 Glutamine amidotransferases class-II 351799 Cytochrome C and Quinol oxidase polypeptide I 351811 Cytochrome b(C-terminal)/b6/petD 351813 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 351814 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 351819 Actin 351835 Ribosomal L10 351835 Ribosomal protein S2 351853 NADH-ubiquinone/plastoquinone oxidoreductase chain 6 351864 BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown 351864 RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains 351864 SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel 351870 Ribosomal protein L6e 351883 Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes 351890 Core histone H2A/H2B/H3/H4 351899 Ribosomal L28e protein family 351911 Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10 351942 Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin 351942 Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group 351943 Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde 351949 Ribosomal family S4e 351951 Intermediate filament protein 351953 Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain 351953 pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain 351961 Actin 351967 Ribosomal protein L13 351969 Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14 351984 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 352006 Intermediate filament protein 352012 Mitochondrial carrier protein 352013 Ribosomal protein L23 352019 Ribosomal family S4e 352027 Ribosomal protein S2 352029 Ribosomal S3Ae family 352037 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 352038 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 352054 Ribosomal protein L6 352054 Ribosomal protein S7e 352097 Spumavirus gag protein 352097 Virulence determinant. The UK protein is an African swine fever virus (ASFV) protein that is highly conserved amongst strains, and is an important viral virulence determinant for domestic pigs 352103 Ribosomal family S4e 352108 Ribosomal protein L6e 352118 Core histone H2A/H2B/H3/H4 352119 Ribosomal Proteins L2, C-terminal domain 352124 Ribosomal L15 352145 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 352146 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 352152 Ribosomal protein L31e 352156 GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron 352157 Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays 352158 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 352160 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 352161 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 352162 Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS) 352170 AMP-binding enzyme 352172 lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member 352172 lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f 352193 Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families 352196 KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia 352230 Cytochrome C and Quinol oxidase polypeptide I 352230 Cytochrome C oxidase subunit II, periplasmic domain 352236 pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain 352266 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka 352272 Ribosomal protein S27 352278 Mitochondrial carrier protein 352327 SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase 352327 AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this 352347 Dynein light chain type 1 352353 Ribosomal S3Ae family 352368 Arginosuccinate synthase. This family contains a PP-loop motif 352391 Ribosomal S17 352391 Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation 352392 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes 352393 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 352400 DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont 352428 Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p 352436 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 352454 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 352455 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 352455 Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be 352457 7 transmembrane receptor (rhodopsin family) 352459 Ribosomal L10 352462 Ubiquitin carboxyl-terminal hydrolase family 2 352462 Ubiquitin carboxyl-terminal hydrolases family 2 352463 Defensin propeptide 352464 Defensin propeptide 352464 Mammalian defensin 352465 Ribosomal S3Ae family 352466 7 transmembrane receptor (rhodopsin family) 352467 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 352468 TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family 352473 Ribosomal protein L6e 352492 Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold 352492 Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold 352507 Core histone H2A/H2B/H3/H4 352524 Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in 352559 Protein kinase domain 352582 Ribosomal protein L13 352617 Fructose-1-6-bisphosphatase 352618 Papain family cysteine protease 352618 Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases 352625 F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores 352626 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T 352628 FKBP-type peptidyl-prolyl cis-trans isomerase 352629 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T 352631 Skp1 family, dimerisation domain 352631 Skp1 family, tetramerisation domain 352632 Skp1 family, dimerisation domain 352632 Skp1 family, tetramerisation domain 352633 FKBP-type peptidyl-prolyl cis-trans isomerase 352636 6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T 352637 FKBP-type peptidyl-prolyl cis-trans isomerase 352638 Skp1 family, dimerisation domain 352638 Skp1 family, tetramerisation domain 352640 FKBP-type peptidyl-prolyl cis-trans isomerase 352642 7 transmembrane receptor (metabotropic glutamate family) 352645 KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. 352656 Zinc finger, C3HC4 type (RING finger) 352659 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 352662 Zinc finger, C3HC4 type (RING finger) 352662 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 352663 FKBP-type peptidyl-prolyl cis-trans isomerase 352664 7 transmembrane receptor (metabotropic glutamate family) 352669 Transcription factor S-II (TFIIS) 352670 Zinc finger, C3HC4 type (RING finger) 352670 Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve 352671 GDA1/CD39 (nucleoside phosphatase) family 352671 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin 352689 Cyclophilin type peptidyl-prolyl cis-trans isomerase 352699 Interferon alpha/beta domain 352700 Interferon alpha/beta domain 352733 Nucleoside diphosphate kinase 352746 Ribosomal protein L31e 352788 Ribosomal protein L19e 352792 Aminotransferase class I and II 352793 MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t 352807 Ribosomal protein L19e 352814 Ribosomal protein L35Ae 352832 Ribosomal S17 352837 Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks 352840 Cytochrome b(N-terminal)/b6/petB 352852 Cytochrome c. The cytochrome 556 and cytochrome c' families are not included 352861 Sodium:neurotransmitter symporter family 352864 ADP-ribosylation factor family 352871 D12 class N6 adenine-specific DNA methyltransferase 352879 Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis 352882 E1-E2 ATPase 352882 haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw 352893 Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility 352896 Ocular albinism type 1 protein 352897 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ 353155 Connexin 353169 Sugar (and other) transporter 353172 WHEP-TRS domain 353174 pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure 353187 Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors 353187 Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other 353188 Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo 353188 Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no 353189 Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 353189 Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution 353190 YjeF-related protein N-terminus 353204 Fructose-bisphosphate aldolase class-I 353218 TNF(Tumor Necrosis Factor) family 353233 Beige/BEACH domain 353250 7 transmembrane receptor (rhodopsin family) 353252 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 353258 Low temperature viability protein 353281 Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is 353288 Intermediate filament protein 353307 Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their 353328 von Willebrand factor type D domain 353328 Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that 353344 7 transmembrane receptor (rhodopsin family) 353345 7 transmembrane receptor (rhodopsin family) 353346 7 transmembrane receptor (rhodopsin family) 353376 emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains 353498 Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor 353500 TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein 359725 short chain dehydrogenase. This family contains a wide variety of dehydrogenases 359726 Protein kinase domain 360034 Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas 360202 Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits